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Date: October 25, 2000 

Docket No . : 2786-0140P 

Assistant Commissioner for Patents 
Box PATENT APPLICATION 
Washington, D.C. 2 0231 

Sir: 

As authorized by the inventor(s), transmitted herewith for filing 
is a patent application applied for on behalf of the inventor (s) 
according to the provisions of 37 CFR 1.41(c). 

Inventor (s) : BERNSTEIN, Jeanne 
LEVINE, Zurit 

For: VARIANTS OF ALTERNATIVE SPLICING 

Enclosed are: 

X A specification consisting of 61 pages 

X 48 sheet (s) of Formal drawings 
Certified copy of Priority Document (s) 

X Executed Declaration in accordance with 37 CFR 1.64 will follow 

A verified statement to establish small entity status under 37 

CFR 1.9 and 37 CFR 1.27 

X Preliminary Amendment 

X Information Sheet 

Information Disclosure Statement, PTO-1449 with reference (s) 
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X Other Sequence Listing (52 pages) 

The filing fee has been calculated as shown below: 
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X The application transmitted herewith is filed in accordance 

with 37 CFR 1.41(c)* The undersigned has been authorized by the 
inventor (s) to file the present application. The original duly 
executed patent application together with the surcharge will be 
forwarded in due course. 

X A check in the amount of $ 1, 394 . 00 to cover the filing fee and 
recording fee (if applicable) is enclosed. 

The Government Filing Fee will be paid at the time of completion 

of the filing requirement. 

Please charge Deposit Account No. 02-2448 in the amount of 

$ . A triplicate copy of this transmittal form is 

enclosed. 

X Send Correspondence to: BIRCH, STEWART, KOLASCH & BIRCH, LLP 

P. O. Box 747 

Falls Church, Virginia 22040-0747 
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No fee is enclosed. 

If necessary, the Commissioner is hereby authorized in this, 
concurrent, and future replies, to charge payment or credit any 
overpayment to Deposit Account No. 02-2448 for any additional 
fees required under 37 C.F.R. 1.16 or under 37 C.F.R. 1.17; 
particularly, extension of time fees. 
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IN THE U.S. PATENT AND TRADEMARK OFFICE 
Applicant: BERNSTEIN, Jeanne et al . 

Appl. No. : New Group: 

Filed: October 25, 2000 Examiner: 

For: VARIANTS OF ALTERNATIVE SPLICING 

PRELIMINARY AMENDMENT 

Assistant Commissioner for Patents October 25, 2000 

Washington, DC 20231 

Sir: 

The following preliminary amendments and remarks are 
respectfully submitted in connection with the above-identified 
application. 

AMENDMENTS 

IN THE CLAIMS : 

Please amend the claims as follows: 

Claim 5: Line 1, change "Claim 5" to --Claim 4 — 



REMARKS 

The amendment to the claims is merely to correct a 
typographical error and to place the application into better form 
for examination. Entry of the present amendment and favorable 
action on the above-identified application are respectfully 
requested. 
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VARIANTS OF ALTERNATIVE SPLICING 



FIELD OF THE INVENTION 

The present invention concerns novel nucleic acid sequences, vectors and 
host cells containing them, ammo acid sequences encoded by said sequences, and 
antibodies reactive with said amino acid sequences, as well as pharmaceutical 
compositions comprising any of the above. The present invention further concerns 
methods for screening for candidate activators or deactivators utilizing said amino 
acid sequences. 

BACKGROUND OF THE INVENTION 

Alternative splicing (AS) is an important regulatory mechanism, in higher 
eukaryotes (P.A. Sharp, Cell 77, 805-8152 (1994). It is thought to be one of the 
most important mechanisms for differential expression, related to tissue or 
development stage specificity. It is known to play a major role in numerous 
biological systems, including human antibody responses, and sex determination 
in Drosophila, (S. Stamm, M.Q. Zhang, T.G. Marr and D.M. Helfinan, Nucleic 
Acids Research 22, 1515-1526 (1994); B. Chabot, Trends Genet. 12, 472-478 
(1996); R.E. Breitbart, A. Andreadis, B. NadaJ-Ginard, Annual Rev. Biochem., 
56, 467-495 (1987); C.W. Smith, J.G. Patton, B. NadaJ-Ginard, Anna. Rev. 
Genet, 27, 527-577 (1989)). 

Until recently it was commonly believed that alternative splicing existed in 
only a small fraction of genes (about 5%). A recent observation based on 
literature survey of known genes revises this conservative estimate to as high as 
an estimate that at least 30% of human genes are alternatively spliced (M.S. 
Gelfand, I. Dubchak, I. Draluk and M. Zora, Nucleic Acids Research 27, 301-302 
(1999). The importance of the actual frequency of this phenomenon lies not only 
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in the direct impact on the number of proteins created (1GG ? 0QQ human genes, for 
example, would be translated to a much higher number of proteins)* but also in 
the diversity of functionality derived from the process. 

Several mechanisms at different stages may be held responsible for the 
5 complexity of higher eukaryote which include: alternative splicing at the 
transcription level, RNA editing at the post-transcriptional level, and 
post-transJational modifications are the ones characterized to date. 

GLOSSARY 

jo In the following description and claims use will be made, at times, with, a 

variety of terms, and the meaning of such terms as they should be construed in 
accordance with the invention is as follows: 

"Variant nucleic acid sequence' 7 - the sequence shown in any one of SEQ ID 
15 NO: 1 to SEQ ID NO: 26, sequences having at least 90% identity preferably 95% 
identity (see below) to said sequence and fragments (see below) of the above 
sequences of least 20 b>p. long. These sequences are sequences coding for a novel, 
naturally occurring, alternative splice variants of native and known genes. It should 
be emphasized that the novel variants of the present invention are naturally 
20 occurring sequences resulting from alternative splicing of genes and not merely 
truncated, mutated or fragmented forms of known sequences which are artificially 
produced. 

"Variant product ~ also referred at times as the "variant protein" or "variant 
25 polypeptide" - is an amino acid sequence encoded by the variant nucleic acid 
sequence which is a naturally occurring mRNA sequence obtained as a result of 
alternative splicing. The amino acid sequence may be a peptide, a protein, as well 
as peptides or proteins having chemically modified amino acids (see below) such as 
a glycopeptide or glycoprotein. The variant products are shown in any one of 
30 SEQ ID NO: 27 to SEQ ID NO: 52. The term also includes homology (see below) 
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of said sequences in which one or more amino acids has been added, deleted, 
substituted (see below) or chemically modified (see below) as well as fragments 
(see below) of this sequence having at least 10 amino acids. 

5 "Nucleic acid sequence" - a sequence composed of DNA nucleotides, RNA 
nucleotides or a combination of both types and may includes natural nucleotides, 
chemically modified nucleotides and synthetic nucleotides* 

"Amino acid sequence" - a sequence composed of any one of the 20 naturally 
10 appearing amino acids, amino acids which have been chemically modified (see 
below), or composed of synthetic amino acids. 

^Fragment of variant nucleic acid sequence" - novel short stretch of nucleic 
acid sequences of at least 20 b.p. ? which does not appear as a continuous stretch 

15 in the original nucleic acid sequence (see below). The fragment may be a 
sequence which was previously undescribed in the context of the published RNA 
and which affects the amino acid sequence encoded by the ioiown gene. For 
example, where the variant nucleic includes a sequence which was not included 
in the original sequence (for example a sequence which was an intron in the 

20 original sequence) the fragment may contain said additional sequence. The 
fragment may also be a region which is not an intron, which was not present in. 
the original sequence. For example where the variant lacks a non-terminal region 
which was present in the original sequence. The two stretches of nucleotides 
spanning this region (upstream and downstream) are brought together by splicing 

25 in the variant, but axe spaced from each by the spliced out region in the original 
sequence and are thus not continuous in the original sequence. A continuous 
stretch of nucleic acids comprising said two sparing stretches of nucleotides is 
not present in the original sequence and thus falls under the definition of 
fragment. 

30 
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"Fragments of variant products" - novel amino acid sequences coded by the 
"fragment of variant nucleic acid sequence " defined above. 

"Homologucs of variants" - amino acid sequences of variants in which one or 
5 more amino acids has been added, deleted or replaced. The addition, deletion or 
replacement should be in the regions or adjacent to regions where the variant 
differs from the original sequence (see below), 

"Consemative substitution" - refers to die substitution of an amino acid in one 
]q class by an amino acid of the same class, where a class is defined by common 
physicochemica] amino acid side chain properties and high substitution 
frequencies in homologous proteins found in nature, as determined, for example, 
by a standard Dayboff frequency exchange matrix or BLOSUM matrix. [Six 
general classes of amino acid side chains have been categorized and include: 
15 Class I (Cys); Class II (Ser, Thr ? Pro, Ala s Gly); Class III (Asn, Asp, Gin, GJu); 
Class IV (His, Arg, Lys); Class V (lie, Leu, Val, Met); and Class VI (Phe, Tyr ? 
Trp). For example, substitution of an Asp for another class HI residue such as 
Asn, Gin, or GIo, is a conservative substitution. 

20 "Non-conservative substitution" - refers to the substitution of an amino acid in 
one class with an amino acid from another class; for example, substitution of an 
Ala, a class U residue, with a class III residue such as Asp, Asn, Glu, or Gin* 

Tf Chemically modified" - when referring to the product of the invention, means a 
25 product (protein) where at least one of its amino acid resides is modified either by 
natural processes, such as processing or other post-translational modifications, or 
by chemical modification techniques which are well, known in the art. Among 
the numerous known, modifications typical, but not exclusive examples include: 
acetylation, acylation, aroidation, ADP-ribo$ylation ? glycosylation, GPI anchor 
30 formation, covalent attachment of a lipid or lipid derivative, methylatjon, 
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rayristlyation, pegylation, prenylation, phosphorylation, ubiqutination, or any 
similar process. 

ff Biologically active" - refers to the variant product having some sort of 
5 biological activity, for example, some physiologically measurable effect on target 
cells, molecules or tissues, 

"Immunologically active" defines the capability of a natural, recombinant or 
synthetic varient product, or any fragment thereof, to induce a specific immune 

to response in appropriate animals or cells and to bind with specific antibodies. 
Thus, for example, an immunologically active fragment of variant product 
denotes a fragment which retains some or all of the immunological properties of 
the variant product, e.g can. bind specific anti-variant product antibodies or which 
can elicit an immune response which will generate such antibodies or cause 

15 proliferation of specific immune cells which produce variant. 

"Optimal alignment" - is defined as an alignment giving the highest percent 
identity score. Such alignment can be performed using a variety of commercially 
available sequence analysis programs, such as the local alignment program 

20 LALIGN using a letup of 1 , default parameters and the default PAM A preferred 
alignment is the one performed using the CLUSTAL-W program from. 
Mac Vector (TM), operated with an open gap penalty of 10.0, an extended gap 
penalty of 0.1 s and a BLOSUM similarity matrix. If a gap needs to be inserted 
into a first sequence to optimally align it with a second sequence, the percent 

25 identity is calculated using only the residues that are paired with a corresponding 
amino acid residue (i.e., the calculation does not consider residues in the second 
sequences that are in the "gap" of the first sequence). In case of alignments of 
known gene sequences with that of the new variant, the optimal alignment 
invariably included aligning the identical parts of both sequences together, then 
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keeping apart and unaligned the sections of the sequences that differ one from the 
other. 

"Having at least X% identity" - with respect to two amino acid or nucleic acid 
5 sequence sequences, refers to the percentage of residues that are identical in the 
two sequences when the sequences are optimally aligned, Thus, 90% amino acid 
sequence idmtity means that 90% of the amino acids in two or more optimally 
aligned polypeptide sequences are identical, however this definition explicitly 
excludes sequences which are 100% identical with the original sequence from 
10 which the variant of the invention was varied. 

"Isolated nucleic acid molecule having an variant nucleic acid sequence" - is a 
nucleic acid molecule that includes the coding variant nucleic acid sequence. Said 
isolated nucleic acid molecule may include the variant nucleic acid sequence as 

15 an independent insert; may include the variant nucleic acid sequence fused to an 
additional coding sequences, encoding together a fusion protein in which the 
variant coding sequence is the dominant coding sequence (for example, the 
additional coding sequence may code for a signal peptide); the variant nucleic 
acid sequence may be in combination with non-coding sequences, e.g., introns or 

20 control elements, such as promoter and terminator elements or 5* and/or 3' 
untranslated regions, effective for expression of the coding sequence in a suitable 
host; or may be a vector in which the variant protein, coding sequence is a 
heterologous. 

25 "Expression vector" - refers to vectors that have the ability to incorporate and 
express heterologous DNA fragments in a foreign cell. Many prokaryotic and 
eukaryotic expression vectors are toown and/or commercially available, 
Selection of appropriate expression vectors is within the knowledge of those 
having skill in the art. 

30 
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tf Deletion" - is a change in either nucleotide or amino acid sequence in which, 
one or more nucleotides or amino acid residues, respectively, are absent. 

"Insertion 1 * or "addition" - is that change in a nucleotide or amino acid 
5 sequence which has resulted in the addition of one or more nucleotides or amino 
acid residues, respectively, as compared to the naturally occurring sequence* 

"Substitution" - replacement of one or more nucleotides or amino acids by 
different nucleotides or amino acids, respectively. As regards amino acid 
10 sequences the substitution may be conservative or non- conservative. 

"Antibody" - refers to IgG, IgM, IgD, IgA, or IgG antibody. The definition 
includes polyclonal antibodies or monoclonal antibodies. This term refers to 
whole antibodies or fragments of the antibodies comprising the antigen-binding 
is domain of the anti-variant product antibodies P e,g, antibodies without the Fc 
portion, single chain antibodies, fragments consisting of essentially only the 
variable, antigen-binding domain of the antibody, etc. 

Distinguishing antibody" - an antibody capable of binding to the variant product 
20 and not the original amino acid sequence from which it has been varied, or an 
antibody capable of binding to the original nucleic acid sequence and not to the 
variant production, 

"Activator" - as used herein* refers to a molecule which mimics the effect of the 
25 natural variant product or at times even increases or prolongs the duration of the 
biological activity of said product, as compared to that induced by the natural 
product The mechanism, may be by any mechanism known to prolonging 
activities of biological molecules such as binding to receptors; prolonging the 
lifetime of the molecules; increasing the activity of the molecules on its target; 
30 increasing the affinity of molecules to its receptor; inhibiting degradation or 
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proteolysis of the molecules, or mimicking the biological activity of the variants 
on their targets, etc. Activators may be polypeptides, nucleic acids, 
carbohydrates, lipids, or derivatives thereof, or any other molecules which can 
bind to and activate the variant product. 

5 

"Deactivator" or ("Inhibitor") *- refers to a molecule which modulates the 
activity of the variant product in an opposite manner to that of the activator, by 
decreasing or shortening the duration of the biological activity of the variant 
product. This may be done by any mechanism known to deactivate or inhibit 
10 biological molecules such as block of the receptor, block of active site, 
competition on binding site in target, enhancement of degradation, etc. 
Deactivators may be polypeptides, nucleic acids, carbohydrates, lipids, or 
derivatives thereof, or any other molecules which bind to and modulate the 
activity of said product, 

15 

"Treating a disease" - refers to administering a therapeutic substance effective 
to ameliorate symptoms associated with a disease, to lessen the severity or cure 
the disease, or to prevent the disease from occurring* 

20 "Detection" - refers to a method of detection of a disease, disorder, pathoJogical 
or normal condition. This term may refer to detection of a predisposition to a 
disease as well as for establishing the prognosis of the patient by determining the 
severity of the disease. 

25 "Probe" - the variant nucleic acid sequence, or a sequence complementary 
therewith, when used to detect presence of other similar sequences in a sample. 
The detection is carried out by identification of hybridization complexes between 
the probe and the assayed sequence. Hie probe may be attached to a solid support 
or to a detectable label. 

30 
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"Originai sequence" - the ammo acid or nucleic acid sequence from which the 
variant of the invention have been varied as a result of alternative slicing. 

SUMMARY OF THE INVENTION 

The present invention is based on the finding of several novel, naturally 
5 occurring splice variants, which are naturally occurring sequences obtained by 
alternative splicing of known genes. The novel splice variants of the invention are 
not merely truncated forms, fragments or mutations of known genes, but rather 
novel sequences which naturally occur within the body of individuals. 

The term "alternative splicing" in the context of the present invention and 

io claims refers to: intron inclusion, exon exclusion, addition or deletion of terminal 
sequences in the variant as compared to the original sequences, as well as to the 
possibility of "intron retention**. Intron retention is an intermediate stage in the 
processing of RNA transcripts, where prior to production of fully processed mRNA 
the intron (naturally spliced in the original sequence) is retained in the variant. 

15 These intermediately processed RNAs may have physiological significance and are 
also within, the scope of the invention. 

The novel variant products of the invention may have the same 
physiological activity as the original peptide from which they have been varied, 
(although perhaps at a different level); may have an opposite physiological activity 

20 from the activity featured by the original peptide from, which they are varied; may 
have a completely different, unrelated activity to the activity of the original from 
which they are varied; or alternatively may have no activity at all and this may lead 
to various diseases or pathological conditions, The variants may also differ from 
the original sequence by physiological properties not related to their activity such as 

25 their localization, their time and condition of expressing cleavage and degradation 
rates, association with other cellular or extracellular compounds and the like. 

The novel variants may also serve for detection purposes, i.e. their presence 
or level may be indicative of a disease, disorder, pathological or normal condition 
or alternatively the ratio between the level variants and the level original peptide 
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from which fhey were varied, or the ratio to other variants may be indicative to a 
disease, disorder, pathological or normal condition. 

For example s for detectional purposes, it is possible to establish differential 
expression of various variants in various tissues. A certain variant may be 
5 expressed mainly in one tissue, while the original sequence from which it has been, 
varied, or another variant may, be expressed mainly in another tissue. 
Understanding of the distribution, of the variants in various tissues may be helpful 
in basic research, for understanding the physiological function of the genes as well 
as may help in targeting pharmaceuticals or developing pharmaceuticals. 

10 The study of the variants may also be helpful to distinguish various stages in 

the life cycles of the same type of celts which may also be helpful for development 
of pharmaceuticals for various pathological conditions in which cell cycles is 
non-normal 9 notably cancer. 

Thus the detection may by determination of the presence or the level of 

3 5 expression of the variant within a specific cell population, comprising said presence 
or level between various cell types in a tissue, between different tissues and 
between individuals. 

Thus the present invention provides by its first aspect, a novel isolated 
nucleic acid molecule comprising or consisting of any one of the coding sequence 

20 SEQ ID NO: 1 to SEQ ID NO: 26, fragments of said coding sequence having at 
least 20 nucleic acids (provided that said fragments are continuous stretches of 
nucleotides not present in the original sequence from which the variant was varied), 
or a molecule comprising a sequence having at least 90% identity, preferably 95% 
identity to SEQ ID NO: 1 to SEQ ID NO: 26, provided that the molecule is not 

25 completely identical to the original sequence from which the variant was varied. 

The present invention further provides a protein or polypeptide comprising 
or consisting of an. amino acid sequence encoded by any of the above nucleic acid 
sequences, termed herein "variant product", for example, an amino acid sequence 
having the sequence as depicted in any one of SEQ ID NO: 27 to SEQ ID NO: 52, 

30 fragments of the above amino acid sequence having a length of at least 1 0 amino 
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acids coded by the above fragments of the nucleic acid sequences, as well as 
homologues of the above amino acid sequences in which one or more of the amino 
acid residues has been substituted (by conservative or non-conservative 
substitution) added, deleted, or chemically modified. 
5 The deletions, insertions and modifications should be in. regions, or adjacent 

to regions, wherein the variant differs from the original sequence. 

For example, where the variant is different from the original sequence by 
addition of a short stretch of 10 amino acids, in the terminal or nan- terminal 
portion of the peptide, the invention also concerns homologues of that variant 
10 where the additional short stretch is altered for example, it includes only 8 
additional amino acids, includes 13 additional amino acids, or it includes 10 
additional amino acids, however some of them being conservative or 
non-conservative substitutes of the original additional 10 amino acids of the novel 
variants. In all cases the changes in the homolog, as compared to tihe original 
15 sequence, are in the same regions where the variant differs from the original 
sequence, or in regions adjacent to said region. 

Another example is where the variant lacks a non-terminal region (for 
example of 20 amino acids) which is present in the original sequence (due for 
example to exon exclusion). The homoJogues may lack in the same region only 17 
20 amino acids or 23 amino acids, Again the deletion is in the same region where the 
variant lacks a sequence as compared to the original sequence, or in a region 
adjacent thereto. 

It should be appreciated that once a man versed in the art's attention is 
directed to the importance of a specific region, due to the fact that this region 

25 diifers in the variant as compared to the original sequence, there is no problem in 
derivating said specific region by addition to it, deleting from it, or substituting 
some amino acids in it. Thus homologues of variants which are derivated from the 
variant by changes (deletion, addition, substitution) only in said region as well as in. 
regions adjacent to it are also a part of the present invention. Generally, if the 

30 variant is distinguished from the original sequence by some sort of physiological 
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activity, then the homolog is distinguished from the original sequence in essentially 
the same manner. 

The present invention further provides nucleic acid molecule comprising or 
consisting of a sequence which encodes the above amino acid sequences, 
5 (including the fragments and homologues of the amino acid sequences). Due to the 
degenerative nature of the genetic code, a plurality of alternative nucleic acid 
sequences, beyond those depicted in any one of SEQ ID NO: 1 to SEQ ID NO: 26, 
can code for the amino acid sequence of the invention. Those alternative nucleic 
acid sequences which code for the same amino acid sequences codes by the 
io sequence SEQ ID NO: 1 to SEQ ID NO; 26 are also an aspect of the of the present 
invention* 

The present invention further provides expression vectors and cloning 
vectors comprising any of the above nucleic acid sequences, as well as host cells 
transfected by said vectors, 

j 5 The present invention still further provides pharmaceutical compositions 

comprising, as an active ingredient, said nucleic acid molecules, said expression 
vectors, or said protein or polypeptide. 

These pharmaceutical compositions are suitable for the treatment of diseases 
and pathological conditions, which can be ameliorated or cured by raising the level. 

20 of any one of the variant products of the invention. 

In the following a list will be given of diseases or pathological conditions 
which may be treated or detected by sequences of the invention (both nucleic acids, 
complementary and amino acids). As will be evident, the disease may be treated or 
detected also by antibodies which bind specifically to the amino acid sequences: 

25 SEQ ID NO: 1 and sequences coded thereby as well as antibodies, may be 

used for regulation (increase or decrease) of cytotoxicity induced by gram-negative 
bacteria as well as for detection of conditions of improper regulation. It may be 
used to combat existing infection by gram-negative bacteria. 
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SEQ ID NO: 2 and 18, sequences coded thereby and antibodies may be used 
for treatment and detection of diseases associated with faulty fat and vitamin 
absorption as well as for various digestive diseases. 

SEQ ID NO: 3, sequences coded thereby and antibodies used for the 
5 treatment and detection of disease involving faulty (too much or too little) blood 
coagulation, fibrinolysis or generation of bradykinn and angiotensin. This may be 
used for treatment of disease of coagulation such as hemophilia or for treatment of 
excess coaguJation or in arteriosclerosis. 

SEQ ID NOS; 4, 5, 6, 7, 8 ? 23 ? 24, 25 and 26 sequences coded thereby and 
10 antibodies cab be used for the treatment and detection of diseases or disorders 
associated with metalloproteinases (such as collogenases), and for the treatment 
and detection of diseases involved in erythrode progenitors. 

SEQ ID NO: 9 and 19 sequences coded thereby and antibodies can be used 
for detection and treatment of thrombolytic disorders such as in pulmonary 
15 embolism as well as diseases involved with the initiation of fibrinolysis. 

SEQ ID NOS: 10 and 11 sequences coded thereby and antibodies can be 
used for the treatment and detection of diseases involved with fibrinolysis. The 
sequences and antibodies may act as regulators of fibronolysis. 

SEQ ID NOS: 12, 20 and 21 and sequences coded thereby and antibodies 
20 can be used for the detection and treatment of connective tissue irregularities and 
diseases involved with PDGF cell surface receptors. 

SEQ ID NO: 13, 14 and 22, sequences encoded thereby and antibodies may 
be used for treatment of tumor, especially tumor due to mutation in tumor 
suppressor genes. In addition, may be used for detection of cancer or predisposition 
25 to cancer. This gene may be associated to tumor in general and colorectal tumors in 
particular, as well as in to lymphatic and hematogenesis, metastasis of the 
esophageal squamous cell carcinoma, and may be used to detect or treat the 
disease. 

SEQ ID NO: 1 5 sequences encoded thereby and antibodies, may be used for 
30 the detection of treatment of diseases involved with the glutamate receptor in the 
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brain, such neurodegenerative disease, epilepsy, psychological disorders, brain 
tumors. 

SEQ ID NOS: 16 and 17 sequences encoded thereby as well as antibodies, 
can be used for the treatment or detection of diseases involving regulation of 
5 extracellular fluid volume mainly hypertension > 

By a second aspect, the present invention provides a nucleic acid molecule 
comprising or consisting of a non-coding sequence which is complementary to that 
of any one of SEQ ID NO: 1 to SEQ ED NO: 26, or complementary to a sequence 
having at least 90% identity, preferably 95% identity to said sequence (with, the 

io proviso added above) or a fragment of said two sequences (according to the above 
definition of fragment), The complementary sequence may be a DNA sequence 
which hybridizes with any one of SEQ of ID NO: 1 to SEQ ID NO; 26 or 
hybridizes to a portion of that sequence having a length sufficient to inhibit the 
transcription of the complementary sequence. The complementary sequence may be 

15 a DNA sequence which can be transcribed into an mRNA being an antisense to the 
mRNA transcribed from any one of SEQ ID NO; 1 to SEQ ED NO: 26 or into an 
mRNA which is an. antisense to a fragment of die mRNA transcribed .from any one 
of SEQ ID NO: 1 to SEQ ID NO: 26 which has a length sufficient to hybridize with 
the mRNA transcribed from. SEQ ID NO; 1 to SEQ ID NO: 26, so as to inhibit its 

20 translation. The complementary sequence may also be the mRNA or fee fragment 
of the mRNA itself 

The nucleic acids of the second aspect of the invention may be used for 
therapeutic or diagnostic applications for example as probes used for the detection 
of the variants of die invention. The presence of the variant transcript or the level of 

25 the variant transcript may be indicative of a multitude of diseases, disorders and 
various pathological as well as normal conditions. In addition or alternatively, the 
ratio of the level of the transcripts of the variants of the invention may also be 
compared to that of the transcripts of the original sequences from which have been 
varied, or to the level of transcript of other variants, and said ratio may be 
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indicaiive to a multitude of diseases, disorders and various pathological and normal 
conditions. 

The present invention also provides expression vectors comprising any one 
of the above defined complementary nucleic acid sequences and host cells 
5 transfected with said nucleic acid sequences or vectors, being complementary to 
those specified in the first aspect of the invention. 

The invention also provides anti-variant product antibodies, namely 
antibodies directed against the variant product which specifically bind to said 
variant product. Said antibodies are useful both for diagnostic and therapeutic 
10 purposes. For example said antibody may be as an active ingredient in a 
pharmaceutical composition as will be explained below 

The present invention also provides pharmaceutical compositions 
comprising, as an active ingredient, the nucleic acid molecules which comprise or 
consist of said camplementaiy sequences, or of a vector comprising said 
15 complementary- sequences. The pharmaceutical composition thus provides 
pharmaceutical compositions comprising, as an active ingredient, said anti- variant 
product antibodi.es. 

The pharmaceutical compositions comprising said anti-variant product 
antibodies or the nucleic acid molecule comprising said complementary sequence, 

20 are suitable for the treatment of diseases and pathological conditions where a 
therapeutically beneficial effect may be achieved by neutralizing the variant (either 
at the transcript or product level) or decreasing the amount of the variant product or 
blocking its binding to its target, for example, by the neutralizing effect of the 
antibodies, or by the effect of the antisense mRNA in decreasing the expression 

25 level of the variant sequence. 

According to the third aspect of the invention the present invention provides 
methods for detecting the level of the transcript (mRNA) of said vari ant product in 
a body fluid sample, or in a specific tissue sample, for example by use of probes 
comprising or consisting of said coding sequences; as well as methods for detecting 

30 levels of expression of said product in tissue, e.g. by the use of antibodies capable 
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of specifically reacting with the variant products of the invention. Detection of the 
level of the expression of the variant of the invention in particular as compared to 
that of the original, sequence from which it was varied or compared to other variant 
sequences all varied from the same original sequence may be indicative of a 
5 plurality of physiological or pathological conditions. 

The method, according to this latter aspect, for detection of a nucleic acid 
sequence which encodes the variant product in a biological sample, comprises the 
steps of: 

(a) providing a probe comprising at least one of the nucleic acid 
10 sequences defined above; 

(b) contacting the biological sample with said probe under conditions 
allowing hybridization of nucleic acid sequences thereby enabling formation of 
hybridization complexes; 

(c) detecting hybridization complexes, wherein the presence of the 
15 complexes indicates the presence of nucleic acid sequence encoding the variant 

product in the biological sample. 

The method as described, above is qualitative, i.e. indicates whether the 
transcript is present in or absent from the sample, The method can. also be 
quantitative, by determining the level of hybridization complexes and then 
20 calibrating said levels to determining levels of transcripts of the desired variant in 
the sample* 

Both qualitative and quantitative deteamination methods can be used for 
diagnostic, prognostic and therapy planning purposes. 

By a preferred embodiment the probe is part of a nucleic acid chip used for 
25 detection purposes, i.e. the probe is a part of an array of probes each present in a 
known location on a solid support. 

The nucleic acid sequence used in the above method may be a DNA 
sequence an RNA sequence, etc; it may be a coding or a sequence or a sequence 
complementary thereto (for respective detection of RNA transcripts or 
30 coding-DNA sequences). By quantization of the level of hybridization complexes 
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and calibrating the quantified results it is possible also to detect the level of the 
transcript in the sample. 

Methods for detecting mutations in the region coding for the variant product 
axe also provided* which may be methods carried-out in a binary fashion, namely 
5 merely detecting whether there is any mismatches between the norma) variant 
nucleic acid sequence of the invention and the one present in the sample, or 
carried-out by specifically detecting the nature and location of the mutation. 

The present invention also concerns a method for detecting variant product 
in a biological sample, comprising the steps of: 
jo (a) contacting with said biological sample the antibody of the invention, 

thereby forming an antibody-antigen complex; and 

(b) detecting said antibody-antigeu complex 

wherein the presence of said antibody-antigen complex correlates with the 
presence of variant product in said biological sample. 
15 Many diseases are diagnosed by detecting the presence of antibodies against 

a protein characterizing the disease in the blood, serum or any other body fluid of 
the patient The present invention also concerns a method for detecting anti-variant 
antibody in a biological sample, comprising; 

(a.) contacting said sample with the variant product of the invention, 
20 thereby forming an antibody-antigen, complex; and 

(b) detecting said antibody-antigen complex 

wherein the presence of said antibody-antigen, complex correlates with the 
presence of anti-variant antibody in the sample. 

As indicated above, both methods (for detection of variant product and for 
25 detection of the anti-variant antibody) can be quantitized to determine die level or 
the amount of the variant or antibody in the sample, alone or in comparison to the 
level of the original amino acid sequence from which it was varied or compared to 
the level of antibodies against the original amino acid sequence, and qualitative and 
quantitative results may be used for diagnostic, prognostic and therapy planning 
30 purposes. 
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The invention also concerns distinguishing antjbodies ? i.e. antibodies 
capable of binding either to the variant product or to the original sequence from 
which the variant has been varied, while not binding to the original sequence or the 
variant product respectively. These distinguishing antibodies may be used for 
5 detection purposes- 

By yet another aspect the invention also provides a method for identifying 
candidate compounds capable of binding to the variant product and modulating its 
activity (being either activators or deactivators). The method includes: 

(i) providing a protein or polypeptide comprising an amino acid 
io sequence substantially 33 depicted in any one of SEQ ID NO: 27 to 52, or a 

fragment of such a sequence; 

(ii) contacting a candidate compound with said amino acid sequence; 

(iii) measuring the physiological, effect of said candidate compound on 
the activity of the amino acid sequences and selecting those compounds which 

1 5 show a significant effect on said physiological activity. 

The present invention also concerns compounds identified by the above 
methods described above, which compound may either be an. activator of the 
variant product or a deactivator thereof 

20 BRIEF DESCRIPTION OF THE DRAWINGS 

In order to understand the invention and to see how it may be carried out in 
practice, a preferred embodiment will now be described, by way of non-limiting 
example only, with reference to the accompanying drawings, in which: 

Fig. 1 is a comparison between the amino acid sequence of SEQ ID NO: 27 
25 and the original sequence from, which it has been varied; 

Fig. 2 is a comparison between the amino acid sequence of SEQ ID NO: 28 
and the original sequence from which it has been varied; 

Fig. 3 is a comparison between the amino acid sequence of SEQ ID NO; 29 
and the original sequence from which it has been varied; 
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Fig. 4 is a comparison between the amino acid sequence of SEQ ID NO: 30 
and the original sequence from which it has been varied; 

Fig. 5 i$ a comparison between, the amino acid sequence of SEQ ID NO: 3 1 
and the original sequence from which it has been varied; 
5 Fig. 6 is a comparison between the amino acid sequence of SEQ ID NO: 32 

and the original sequence from which it has been varied; 

Fig, 7 is a comparison between the amino acid sequence of SEQ ID NO: 33 
and the original sequence from which it has been varied; 

Fig. 8 is a comparison between the amino acid sequence of SEQ ID NO: 34 
10 and the original sequence from which it has been varied; 

Fig, 9 is a comparison between the amino acid sequence of SEQ ID NO: 35 
and the original sequence from which it has been varied; 

Fig. 10 is a comparison between, the amino acid sequence of SEQ ID 
NO: 36 and the original sequence from which it has been varied; 
15 Fig. 11 is a comparison between the amino acid sequence of SEQ ED 

NO: 37 and the original sequence from which it has been varied; 

Fig. 12 is a comparison between the amino acid sequence of SEQ ID 
NO: 38 and the original sequence from which it has been varied; 

Fig. 13 is a comparison between the amino acid sequence of SEQ ID 
20 NO: 39 and the original sequence from which it has been varied; 

Fig. 14 is a comparison between the amino acid sequence of SEQ ID 
NO: 40 and the original sequence from which it has been varied; 

Fig. 15 is a comparison between the amino acid sequence of SEQ ID 
NO: 41 and the original sequence from which it has been varied; 
25 Fig. 16 is a comparison between the amino acid sequence of SEQ ID 

NO: 42 and the original sequence from which it has been varied; 

Fig. 17 is a comparison between the amino acid sequence of SEQ ID 
NO; 43 and the original sequence from which it has been varied. 

Fig. 18 is a comparison between the amino acid sequence of SEQ ID 
30 NO: 44 and the originaJ sequence from which it has been varied. 
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Fig, 19 is a comparison between the amino acid sequence of SEQ ID 
NO: 45 and the original sequence from which it has been varied. 

Fig. 20 is a comparison between the amino acid sequence of SEQ ID 
NO; 46 and the original sequence from which it has been varied, 
5 Fig. 21 is a comparison between the amino acid sequence of SEQ ID 

NO; 47 and the original sequence from which it has been varied. 

Fig. 22 is a comparison between the amino acid sequence of SEQ ID 
NO: 48 and the original, sequence from which it has been varied. 

Fig. 23 is a comparison between the amino acid sequence of SEQ ID 
to NO: 49 and the original sequence from which it has been varied. 

Fig. 24 is a comparison between the amino acid sequence of SEQ ID 
NO: 50 and the original sequence from which it has been varied. 

Fig. 25 is a comparison between the amino acid sequence of SEQ ED 
NO: 5 1 and the original sequence from which it has been varied. 
15 Fig. 26 is a comparison between the amino acid sequence of SEQ ID 

NO: 52 and the original sequence from which it has been varied. 

DETAILED DESCRIPTION OF PREFERRED EMBODIMENTS 
Example I: Comparison of variants with original sequences 

20 Original sequences were obtained from GenBank Version 110. 

Comparison between the original sequences and the noval variant sequences was 
made using the Pileup application from the GCG $uite version 10,0 (January 
1999), with the defalut values: 

Gap creation penalty (Gap Weight): 8 
25 Gap extension, penally (GapLength Weight): 2 

The comparison is shown in Fig. 1 to 26 which show the comparison of 
each of the variant products depicted in SEQ ID NO: 27 to 52 with the original 
sequence from which it was varied. 

The following is a table that compares the sequences of the variants of the 
30 invention to the original sequences from which they were varied and indicates here 
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the variant differs from the original sequence. The terminology NV-1 to NV-26 
corresponds to SEQ ID NO: 1 to SEQ ID NO:26. 

TABLE 



5 



Accession # 


New Variant # 


Description of the new variant 


BPI_HUMAN 


NV-1 


Replacement of the )ast C-terminal 92 aa of the 
protein with alternative 100 aa, which has an 
additional potential transmembrane domain. 


BALJrIUMAN 


NV-2 


Replacement of 121 C-terroinaJ amino acids of the 
original protein by alternative 7 amino acids. 


FAJ2_HUMAN 


NV-3 


Insertion of 30 amino acids in the catalytic domain of 
the original protein. 


TIMIJiUMAN 


NV-4 


Replacement of 52 C-terrninal amino acids of the 
origginal protein by alternative 9 amino acids. 
Missing 3 disulfide bonds. 


TIMl_HUMAN 


NV-5 


Replacement of 94 C-terrninal amino acids of the 
original protein by alternative 10 amino acids. Has 
only 1 out of 5 diS bonds, has an extra Cys in the 
added 10 amino acids. 


TIMIJIUMAN 


NV-6 


Replacement of one of the exons of the original 
protein by a homologous exon. 


TIMIJIUMAN 


NV-7 


Deletion of 47 amino acids of the original protein. 
Missing 2 disulfide bonds, 


TIM INHUMAN 


NV-S 


Deletion of 10 amino acids of the original protein. 
Missing one out of 2 glycosylation sites, 


URGK HUMAN 


NV-9 


Insertion of 62 amino acids in long A chain. 


PAUJTUMAN 


NV-10 


Replacement of 1.40 C-terminal ammo acids of the 
original protein, including the active site and 2 out of 
3 glycosylation sites, by alternative 12 amino acids. 


PAIi_HUMAN 


NV-I1 


Replacement of 17 C-terminaJ amino acids of tile 
original protein by alternative 14 amino acids. 


CTGFJ-IUMAN 


NV-I2 


Deletion of 32 amino acids, at positions 152-185 of 
the original protein. 


DCC_HUMAN 


NV-1 3 


Replacement of 1118 C-terminai amino acids of the 
original protein, including cytoplasmic domain, 
transmembrane region and part of the extracellular 
domain, by alternative 12 amino acoids. The deleted 
region contains all 5 fibronectin type domains and I. 
out of the 4 Ig-like C2 type domain. 


DCC.HUMAN 


NV-14 


Replacement of 28 C-termtod amino acids from the 
cytoplasmic domain of the original protein by 
alternative ] 3 amino acids. 


MGR4J1UMAN 


NV-1 5 


Deletion of 58 amino acids from the extracellular 
domain of the original protein. 


MCRJiUMAN 


NV-1 6 


Replacement of 147 C-terminal amino acids, 
including part of the steroid binding domain of the 
original protein, by alternative 8 amino acids. 
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MCRJHUMAN 


NV-17 


Deletion of 147 C-terminal amino acids, including 
part of the steroid binding domain of the original 
protein. 


BAL HUMAN 


NV-18 


Deletion of last 1 02 aa at the C-terminus domain 


UROK^HUMAN 


NV-19 


Replacement of 308 aa from C-terminus in 78 
different aa 


CTGF HUMAN 


NV-20 


Deletion of N-termmus 276 aa 


CTGFJHUMAN 


NV-21 


Replacement of C-terminus from the position 151 in 
10 a* 


DCC HUMAN 


NV-22 


Deletion of C-termirnis domain from aa 139 


TIM. INHUMAN 


NV-23 


From amxrro-acid 103 of the original protein there is a 
replacement of the original C-terminus in 44 amino 
acids 


TIM1JHUMAN 


NY-24 


Replacement of the first 67 amino acids of the 
original protein in different 13 amino acids f from 
amino acid 111 of the original protein there is a 
replacement of the original C-terminus it) 25 amino 
acids 


TIM INHUMAN 


NV-25 


From amino acids 98 of the original protein there js a 
replacement of the original 109 amino acids of the 
C-terminus in 17 different ammo-acids 


TIM INHUMAN 


NV-26 


Deletion of 65 amino acids of the N-terminus domain 
of the original protein. 



Example IT: Designatioa of Hie original sequences 

Each novel variant of the invention is varied from an original sequence 
5 which has a known designation. Hie designation of the RNA sequences of the 
original sequences from which it was varied and the Accession Number of the 
original sequence are given below, First, information concerning the original 
sequence is given and then designation of the novel variants of the invention is 
given as NV-1 to NV-26 corresponding to SEQ ID NO; 1 to SEQ ID NO: 26. 
lo Diseases which can be treated or detected by the variants of the invention are also 
mentioned m the description of the original sequence. 

BPIJJUMAN 

i 5 BACTERICIDAL PER3ME ABILITY-INCREASING PROTEIN 



FUNCTION : THE CYTOTOXIC ACTION OF BPI IS LIMITED TO MANY 
SPECIES OF GRAM-NEGATIVE BACTERIA; THIS SPECIFICITY MAY BE 
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EXPLAINED BY A STRONG AFFINITY OF THE VERY BASIC 
N-TERMINAL HALF FOR THE NEGATIVELY CHARGED 
LIPOPOLYSACCHARIDES THAT ARE UNIQUE TO THE 
GRAM-NEGATIVE BACTERIAL OUTER ENVELOPE. 
5 SUBCELLUL AR LOCATION ; MEMBRANE-ASSOCIATED IN 
POLYMORPHONUCLEAR LEUKOCYTES (PMN) GRANULES. 
TISSUE SPECIFICITY : RESTRICTED TO CELLS OF THE MYELOID 
SERIES. 

DOMAIN : THE N-TERMINAL REGION MAY BE EXPOSED TO THE 
10 INTERIOR OF THE GRANULE, WHEREAS THE C-TERMINAL PORTION 
MAY BE EMBEDDED IN THE MEMBRANE. DURING PHAGOCYTOSIS 
AND DEGRANULATION, PROTEASES MAY BE RELEASED AND 
ACTIVATED AND CLEAVE BPI AT THE JUNCTION OF THE N- AND 
C-TERMINAL PORTIONS OF THE MOLECULE, PROVIDING 
15 CONTROLLED RELEASE OF THE N-TERMINAL ANTIBACTERIAL 
FRAGMENT WHEN BACTERIA ARE INGESTED. 



NV-1: 

20 



BALHUMAN 

BILE-SALT-ACTIVATED LIPASE (BAD 
25 CHOLESTEROL ESTERASE 

FUNCTION : CATALYZES FAT AND VITAMIN ABSORPTION. ACTS IN 
CONCERT WITH PANCREATIC LIPASE AND COLIPASE FOR THE 
COMPLETE DIGESTION OF DIETARY TRIGLYCERIDES . 
30 CATALYTIC ACTIVITY : TRIACYLGLYCEROL + H(2)0 = 
DIACYLGL YCEROL + A FATTY ACID ANION. 

CATALYTIC ACTIVITY : A STERYL ESTER + H(2)0 - A STEROL + A 
FATTY ACID. 

ENZYME REGULATION : ACTIVATED BY BILE SALTS CONTAINING A 
35 7-HYDROXYL GROUP. 

TISSUE SPECIFICITY : MAMMARY GLAND, AND PANCREAS. 
SIMILARITY : BELONGS TO THE TYPE-B CARBOXYLESTERASE/LIPASE 
FAMILY. 



40 NV-2 :NV 18 
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FA12JHUMAN 

COAGULATION FACTOR XH 

5 FUNCTION : FACTOR XJI IS A SERUM GLYCOPROTEIN THAT 
PARTICIPATES IN THE INITIATION OF BLOOD COAGULATION, 
FIBRINOLYSIS, AND THE GENERATION OF BRADYKININ AND 
ANGIOTENSIN. 

CATALYTIC ACTIVITY : CLEAVES SELECTIVELY ARG-|-ILE BONDS 
l o AND ACTIVATES COAGULATION FACTORS VII AND XT. 
PTM : O- AND N-GLYCOSYLATED. 

DISEASE : DEFECTS IN F12 DO NOT CAUSE ANY CLINICAL 
SYMPTOMS. THE SOLE EFFECT IS THAT WHOLE-BLOOD CLOTTING 
TIME IS PROLONGED. 

15 MISCELLANEOUS : FACTOR XII, PREKALLIKREIN, AND HMW 
KININOGEN FORM A COMPLEX BOUND TO AN ANIONIC SURFACE. 
PREKALLIKREIN IS CLEAVED BY FACTOR XII TO FORM KALLIKREIN, 
WHICH THEN CLEAVES FACTOR XII FIRST TO ALPHA-FACTOR XEA 
AND THEN TO BETA-FACTOR XIIA. ALPHA-FACTOR XTIA ACTIVATES 

20 FACTOR XI TO FACTOR XIA. 



NV-3: 



25 TIMIJIUMAN 

METALLOPROTEINASE INHIBITOR 1 

FUNCTION : COMPLEXES WITH METALLOPROTEINASES (SUCH AS 
30 COLLAGENASES) AND IRREVERSIBLY INACTIVATE THEM. ALSO 
MEDIATES ERYTHROPOIESIS IN VITRO; BUT, UNLIKE IL-3, IT IS 
SPECIES-SPECIFIC, STIMULATING THE GROWTH AND 
DIFFERENTIATION OF ONLY HUMAN AND MURINE ERYTHROID 
PROGENITORS. 

35 EIM: THE ACTIVITY OF TTMP-1 IS DEPENDENT ON THE PRESENCE OF 
DISULFIDE BONDS. 

SIMILARITY : BELONGS TO THE TIMP FAMILY. 



40 NV-4: NV_23, NV_24, NV_24, NVJ26 
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THM1_HUMAN 
METALLOPRQTEINASE INHIBITOR 1 

5 

NV-5: 



TIM] HUMAN 

10 

METALLOPRQTEINASE INHIBITOR 1 



NV-6 ; 

15 

TIM1_HUMAN 
METALLOPRQTEINASE INHIBITOR 1 



20 

NV-7 



TIM1_HUMAN 

25 

METALLOPRQTEINASE INHIBITOR 1 



NV-8 

30 

UROKHUMAN 
UROKINASE-TYPE PLASMINOGEN ACTIVATOR 

35 

FUNCTION ; POTENT PLASMINOGEN ACTIVATOR AND IS CLINICALLY 

USED FOR THERAPY OF THROMBOLYTIC DISORDERS. 

CATALYTIC ACTIVITY : SPECIFIC CLEAVAGE OF ARG-j-VAL BOND IN 

40 PLASMINOGEN TO FORM PLASMIN. 

SUBUNIT : FOUND IN HIGH AND LOW MOLECULAR MASS FORMS. 
EACH CONSISTS OF TWO CHAINS, A AND B. THE HIGH MOLECULAR 
MASS FORM CONTAINS A LONG CHAIN A. CLEAVAGE OCCURS 
AFTER RESIDUE 1 55 IN THE LOW MOLECULAR MASS FORM TO YIELD 

45 A SHORT Al CHAIN, 
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PHARMACEUTIC AL : AVAILABLE UNDER THE NAME ABBOKINASE 
(ABBOTT). USED IN PULMONARY EMBOLISM (PE) TO INITIATES 
FIBRINOLYSIS. 



5 NV-9: NV 19 



PAI1_HUMAN 

10 PLASMINOGEN ACTJVATOR INHIBITORS, ENDOTHELIAL 

FUNCTION : THIS INHIBITOR ACTS AS "BAIT" FOR TISSUE 
PLASMINOGEN ACTIVATOR, UROKINASE, AND PROTEIN C. ITS RAPID 
INTERACTION WITH TPA MAY FUNCTION AS A MAJOR CONTROL 

15 POINT IN THE REGULATION OF FIBRINOLYSIS. 

DISEASE : HIGH CONCENTRATIONS OF THIS PROTEIN HAVE BEEN 
ASSOCIATED WITH HUMAN THROMBOEMBOLIC DISEASE. 
MISCELLANEOUS : PAH IS INACTIVATED BY PROTEOLYTIC ATTACK 
OF THE UROKINASE-TYPE (U-PA) AND THE TISSUE-TYPE (TPA), 

20 CLEAVTNG THE 369(R)-370(M) BOND. 

NV-10 : 

PAI1_HUMAN 

25 

PLASMINOGEN ACTIVATOR INHIBITOR-1. ENDOTHELIAL 
NV-11: 



30 

CTGFjaUMAN 
CONNECTIVE TISSUE GROWTH FACTOR 

35 FUNCTION : MAJOR CONNECTIVE TISSUE MITOATTRACTANT 
SECRETED BY HUMAN VASCULAR ENDOTHELIAL CELLS. THIS 
IMMEDIATE-EARLY PROTEIN MAY BIND ONE OF THE PDGF CELL 
SURFACE RECEPTORS. 
SUBUNIT : MONOMER. 

40 ALTERNATIVE PRODUCTS: A SHORTER FORM MAY BE PRODUCED 
BY ALTERNATIVE SPLICING OF THE SAME GENE: VARSPLIC 172 
198 MISSING (IN SHORT FORM). 

SIMILARITY : BELONGS TO THE INSULIN-LIKE GROWTH FACTOR 
BINDING PROTEIN FAMILY. CEF- 1 0/C YR6 1/CTFG/FISP- 1 2/NO V 
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NV-12; NV_20, NVJ21 

5 DCCJiUMAN 

TUMOR SUPPRESSOR PROTEIN DCC 

FUNCTION : IMPLICATED AS A TUMOR SUPPRESSOR GENE. 
10 SUBCELLULAR LOCATION : TYPE I MEMBRANE PROTEIN- 
TISSUE SPECIFICITY : FOUND IN AXONS OF THE CENTRAL AND 
PERIPHERAL NERVOUS SYSTEM AND IN DIFFERENTIATED CELL 
TYPES OF THE INTESTINE. 

DISEASE : COLORECTAL TUMORS THAT LOST THEIR CAPACITY TO 
15 DIFFERENTIATE INTO MUCUS PRODUCING CELLS UNIFORMLY LACK 
DCC EXPRESSION. INACTIVATION OF DCC DUE TO ALLELIC 
DELETION AND/OR POINT MUTATIONS MAY CAUSE BOTH 
LYMPHATIC AND HEMATOGENOUS METASTASIS OF OESOPHAGEAL 
SQUAMOUS CELL CARCINOMAS. 
20 SIMILARITY : CONTAINS 4 IMMUNOGLOBULIN-LIKE C2-TYPE 
DOMAINS. 

SIMILARITY : CONTAINS 6 FIBRONECTIN TYPE III-LIKE DOMAINS. 
25 NV-13 ; NV 22 

DCCJHUMAN 

30 TUMOR SUPPRESSOR PROTEIN DCC 

NV-14: 



35 
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MGR4JHUMAN 

METABOTROPIC CLUTAMATE RECEPTOR 4 

5 FUNCTION : RECEPTOR FOR GLUTAMATE. THE ACTIVITY OF THIS 
RECEPTOR IS MEDIATED BY A G-PROTEIN THAT INHIBITS 
ADENYLATE CYCLASE ACTIVITY. 

SUBCELLULAR LOCATION : INTEGRAL MEMBRANE PROTEIN. 
TISSUE SPECIFICITY : STRONGLY EXPRESSED IN THE CEREBELLUM. 
jo EXPRESSED AT LOW LEVELS IN HIPPOCAMPUS, HYPOTHALAMUS 
AND THALAMUS. NO EXPRESSION DETECTED IN LIVER. 
SIMILARITY : BELONGS TO FAMILY 3 OF G-PROTEIN COUPLED 
RECEPTORS, STRONGEST, TO MGLUR6. 

15 

NV-15: 



MCRHUMAN 

20 

MTNERALOCORTICOTP RECEPTOR 

FUNCTION : RECEPTOR FOR BOTH MINERALOCORTICOIDS (MC) SUCH 

AS ALDOSTERONE AND GLUCOCORTICOIDS (GC) SUCH AS 
25 CORTICOSTERONE OR CORTISOL. THE EFFECT OF MC IS TO 

INCREASE ION AND WATER TRANSPORT AND THUS RAISE 

EXTRACELLULAR FLUID VOLUME AND BLOOD PRESSURE AND 

LOWER POTASSIUM LEVELS. 

SUBCELLULAR LOCATION : NUCLEAR. 
30 DOMAIN : COMPOSED OF THREE DOMAINS: A MODULATING 

N-TERMINAL DOMAIN, A DNA-BINDING DOMAIN AND A 

C-TERMINAL STEROID-BINDING DOMAIN. 

SIMILARITY : BELONGS TO THE NUCLEAR HORMONE RECEPTORS 
FAMILY. NR3 SUBFAMILY. 

35 

NV-16: 

MCRJKUMAN 

40 

MINERALOCORTICOTD RECEPTOR 



NV-17 : 

45 
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Example UI: Variant nucleic acid sequence 

The nucleic acid sequences of the invention include nucleic acid 
sequences which encode variant product and fragments and analogs thereof The 
5 nucleic acid sequences may alternatively be sequences complementary to the 
above coding sequence, or to a region of said coding sequence. The length of the 
complementary sequence is sufficient to avoid the expression of the coding 
sequence. The nucleic acid sequences may be in the form of RNA or in the form 
of DNA, and include messenger RNA, synthetic RNA and DNA, cDNA ? and 

10 genomic DNA. The DNA may be double-stranded or single-stranded, and if 
single-stranded may be die coding strand or the non-coding (anti-sense, 
complementary) strand. The nucleic acid sequences may also both, include 
dNTPs, rNTPs as well as non naturally occurring sequences. The sequence may 
aJso be a part of a hybrid between an amino acid sequence and a nucleic acid 

15 sequence. 

In a general embodiment, the nucleic acid sequence has at least 90% 
identity, preferably 95% identity with any one of the sequence identified as SEQ 
ID NO: 1 to SEQ ID NO: 26 provided that this sequence is not completely 
identical with that of the original sequence. 

20 The nucleic acid sequences may include the coding sequence by itself. By 

another alternative the coding region may be in combination with additional 
coding sequences, such as those coding for fusion protein or signal peptides, in 
combination with non-coding sequences, such as introns and control elements, 
promoter and tenninator elements or 5' and/or 3' untranslated regions, effective 

25 for expression of the coding sequence in a suitable host, and/or in a vector or host 
environment in which the variant nucleic acid sequence is introduced as a 
heterologous sequence. 

The nucleic acid sequences of the present invention may also have the 
product coding sequence fused in-frame to a marker sequence which allows for 

30 purification of the variant product. The marker sequence may be, for example, a 
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hexabistidine tag to provide for purification of the mature polypeptide fused to 
the marker in the case of a bacterial host, or ? the marker sequence may be a 
hemagglutinin (HA) tag when a mammalian host, e.g. COS-7 ceJIs, is used. The 
HA tag corresponds to an epitope derived from the influenza hemagglutinin 
5 protein (Wilson, L, et at Cell 37:767 (1984)), 

Also included in the scope of the invention are fragments as defined above 
also referred to herein as oligonucleotides, typically having at least 20 bases, 
preferably 20-30 bases corresponding to a region of the coding-sequence nucleic 
acid sequence. The fragments may be used as probes, primers, and when 
10 complementary also as antisense agents, and the like, according to known 
methods. 

As indicated above, the nucleic acid sequence may be substantially a 
depicted in any one of SEQ ID NO: 1 to SEQ ID NO: 26 or fragments thereof or 
sequences having at least 90% identity, preferably 95% identity to the above 
15 sequence as explained above. Alternatively, due to the degenerative nature of the 
genetic code, the sequence may be a sequence coding for any one of the ammo 
acid sequence of SEQ ED NO: 27 to SEQ ID NO: 52, or fragments or analogs of 
said amino acid sequence. 

20 A. Preparation of nucleic acid sequences 

The nucleic acid sequences may be obtained by screening cDNA libraries 
using oligonucleotide probes which can hybridize to or PCR-amplify nucleic acid 
sequences which encode the variant products disclosed above. cDNA libraries 
prepared from a variety of tissues are commerciaJly available and procedures for 
25 screening and isolating cDNA clones are well-known to those of skill in the art. 
Such techniques are described in, for example, Sarahrook et al (1989) Molecular 
Cloning: A Laboratory Manual (2nd Edition), Cold Spring Harbor Press, 
Flainview, NT, and Ausubel FM et al. (1989) Cinrent Protocols in Molecular 
Biology, John Wiley & Sons, New York, NT. 
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The nucleic acid sequences may be extended to obtain upstream and 
downstream sequences such as promoters, regulatory elements, and 5 f and 3* 
untranslated regions (UTRs), Extension of the available transcript sequence may 
be performed by numerous methods known to those of skill in the art, such as 
5 PCR or primer extension (Sambrook et aL, supra), or by the RACE method 
using, for example, the Marathon RACE kit (Clontech, Cat. # Kl 802-1). 

Alternatively, the technique of "restriction-site" PCR (Gobinda et al PCR 
Methods Applic, 2:318-22, (1993)), which uses universal primers to retrieve 
flanking sequence adjacent a known locus, may be employed. First, genomic 

jo DNA is amplified in the presence of primer to a linker sequence and a primer 
specific to the known region. The amplified sequences are subjected to a second 
round of PCR with the same linker primer and another specific primer internal to 
the first one. Products of each round of PCR are transcribed with an appropriate 
RNA polymerase and sequenced using reverse transcriptase. 

15 Inverse PCR can be used to amplify or extend sequences using divergent 

primers based on a known region (Triglia, T\ et ai 5 Nucleic Acids Res. 16:8186, 
(1988)). The primers may be designed using OLIGO(R) 4.06 Primer Analysis 
Software (1992; National Biosciences Inc, Plymouth, Minn.)* or another 
appropriate program, to be 22-30 nucleotides in length, to have a GC content of 

20 50% or more, and to anneal to the target sequence at temperatures about 68-72°C 
The method uses several restriction enzymes to generate a suitable fragment in 
the known region of a gene. The fragment is then circularized by intramolecular 
ligation and used as a PCR template. 

Capture PCR (Lagerstrom, M. et aL, PCR Methods Applic* 1:111-19, 

25 (1991)) is a method for PCR amplification of DNA fragments adjacent to a 
known sequence in human and yeast artificial chromosome DNA, Capture PCR 
also requires multiple restriction enzyme digestions and ligations to place an 
engineered double-stranded sequence into a flanking part of the DNA molecule 
before PCR. 
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Another method which may be used to retrieve flanking sequences is that 
Of Parker, JD., ei al, Nucleic Acids Res,, 19:3055-60, (1991)). Additionally, one 
can use PGR, nested primers and PromoterFinder™ libraries to "walk in" genomic 
DNA (PromoterFinder™; Clontech, Palo Alto, CA). This process avoids the need 
5 to screen libraries and is useful in finding intron/exon junctions. Preferred 
libraries for screening for full length cDNAs are ones that have been size-selected 
to include larger cDNAs. Also, random primed libraries are preferred in that they 
will contain more sequences which contain the 5 ! and upstream regions of genes. 
A randomly primed library may be particularly useful if an oligo d(T) 
io library does not yield a full-length cDNA. Genomic libraries are useful for 
extension into the 5 f nontranslated regulatory region. 

The nucleic acid sequences and oligonucleotides of the invention can also 
be prepared by solid-phase methods, according to known synthetic methods. 
Typically, fragments of up to about 100 bases are individually synthesized, then 
15 joined to form continuous sequences up to several hundred bases. 

B* Use of variant nucleic acid sequence for the production of 
variant products 

2o In accordance with the present invention* nucleic acid sequences specified 

above may be used as recombinant DNA molecules that direct the expression of 
variant products. 

As will be understood by those of skill in the art, it may be advantageous 
to produce variant product-encoding nucleotide sequences possessing codons 
25 other than those which appear in any one of SEQ ID NO: 1 to SEQ ID NO: 26 
which are those which naturally occur in the human genome. Codons preferred 
by a particular prokaryotic or eukaryotic host (Murray, E, et al. Nuc Acids Res., 
17:477-508, (1989)) can be selected, for example, to increase the rate of variant 
product expression or to produce recombinant RNA transcripts having desirable 



-33- 

properties, such as a longer half-life, than transcripts produced from naturally 
occurring sequence. 

The nucleic acid sequences of the present invention can be engineered in 
order to alter a variant product coding sequence for a variety of reasons, 
5 including but not limited to, alterations which modify the cloning, processing 
and/or expression of the product. For example, alterations may be introduced 
using techniques which are well known in the art, e.g., site-directed mutagenesis, 
to insert new restriction sites, to alter glycosylation patterns, to change codon 
preference, etc. 

10 The present invention also includes recombinant constructs comprising 

one or more of the sequences as broadly described above. The constructs 
comprise a vector, such as a plasmid or viral vector, into which a nucleic acid 
sequence of the invention has been inserted, in a forward or reverse orientation. 
In a preferred aspect of this embodiment, the construct farther comprises 

15 regulatory sequences, including, for example, a promoter, operably linked to the 
sequence. Large numbers of suitable vectors and promoters are known to those 
of skill in the art, and are commercially available. Appropriate cloning and 
expression vectors for use with prokaryotic and eukaryotic hosts are also 
described in Sambrook, et al } (supra). 

20 The present invention also relates to host cells which are genetically 

engineered with vectors of the invention, and the production of the product of the 
invention by recombinant techniques. Host cells are genetically engineered (i.e., 
transduced, transformed or transfected) with the vectors of this invention which 
may be s for example, a cloning vector or an expression vector. The vector may 

25 be, for example, in the form of a plasmid, a viral particle, a phage, etc. The 
engineered host cells can be cultured in conventional nutrient media modified as 
appropriate for activating promoters, selecting transformants or amplifying the 
expression of the variant nucleic acid sequence. The culture conditions, such as 
temperature, pH and the like, are those previously used with the host cell selected 

so for expression, and will be apparent to those skilled in the art. 
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The nucleic acid sequences of the present invention may be included in 
any one of a variety of expression vectors for expressing a product. Such vectors 
include chromosomal, nonchromosomal and synthetic DNA sequences, e.g., 
derivatives of SV40; bacterial pJasmids; phage DNA; baculovirus; yeast 
5 plasmids; vectors derived from combinations of plasmids and phage DNA 3 viraj 
DNA such as vaccinia, adenovirus^ fowl pox virus, and pseudorabies. However, 
any other vector may be used as long as it is replicable and viable in the host* 
The appropriate DNA sequence may be inserted into the vector by a variety of 
procedures. In general, the DNA sequence is inserted into an appropriate 
10 restriction endonuciease site(s) by procedures known in the art. Such procedures 
and related sub-cloning procedures are deemed to be within the scope of those 
skilled in the art. 

The DNA sequence in the expression vector is operatively linked to an 
appropriate transcription control sequence (promoter) to direct jtjRNA synthesis, 

15 Examples of such promoters include: LTR or SV40 promoter, the E.coli lac or 
trp promoter, the phage lambda PL promoter, and other promoters known to 
control expression of genes in prokaryotic or eukaryotic cells or their viruses* 
The expression vector also contains a ribosome binding site for translation 
initiation, and a transcription terminator. The vector may also include 

20 appropriate sequences for amplifying expression. In addition, the expression 
vectors preferably contain one or more selectable marker genes to provide a 
phenotypic trait for selection of transformed host cells such as dihydrofolate 
reductase or neomycin resistance for eukaryotic cell culture, or such as 
tetracycline or ampicilhn resistance in E.coli 

25 The vector containing the appropriate DNA sequence a.s described above, 

as well as an appropriate promoter or control sequence, may be employed to 
transform an appropriate host to permit the host to express the protein. Examples 
of appropriate expression hosts include: bacterial cells, such as Kcoli, 
Sireptomyces, Salmonella typhimwium; fungal cells, such as yeast; insect cells 

30 such as Drosophila and Spodoptera Sf9; animal cells such as CHO, COS, HEK 
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293 or Bowes melanoma; adenoviruses; plant cells, etc, The selection of an 
appropriate host is deemed to be within the scope of those skilled in the art from 
the teachings herein. The invention is not limited by the host cells employed. 

In bacterial systems* a number of expression vectors may be selected 

5 depending upon the use intended for the variant product. For example, when 
large quantities of variant product are needed for the induction of antibodies, 
vectors which direct high level expression of fusion proteins that are readily 
purified may be desirable. Such vectors include, but are not limited to ? 
multifunctional E.coli cloning and expression vectors such as Blwscript(R) 

10 (Stratagene), in which the variant polypeptide coding sequence may be ligated 
into the vector in-frame with sequences for the amino-terminal Met and the 
subsequent 7 residues of beta-galactosidase so that a hybrid protein is produced; 
pIN vectors (Van Heeke & Schuster J. Biol Chem. 264:5503-5509, (1989)); pET 
vectors (Novagen, Madison WI); and the like, 

j 5 In the yeast Saccharomyces cerevisiae a number of vectors containing 

constitutive or inducible promoters such as alpha factor, alcohol oxidase and 
PGH may be used. For reviews, see Ausubel et al (supra) and Grant et al, 
(Methods in Enxymohgy 153:516-544, (1987)). 

In cases where plant expression vectors are used, the expression of a 

20 sequence encoding variant product may be driven by any of a number of 
promoters. For example, viral promoters such as the 35S and 1.9S promoters of 
CaMV (Brisson et al, Nature 310:511-514. (1984)) may be used alone or in 
combination with the omega leader sequence from TMV (Takamatsu et al, 
EMBO X, 6:307-31 1, (1987)). Alternatively, plant promoters such as the small 

25 subunit of RUBISCO (Coruzzi et al y EMBO J. 3: 1671-1680, (1984); Broglie et 
al, Science 224:838-843, (1984)); or heat shock promoters (Winter J and 
Sinibaldi RM, Results Probl Cell Differ., 17:85-105, (1991)) may be used. 
These constructs can be introduced into plant cells by direct DNA transformation 
or pathogen-mediated transfection, For reviews of such techniques, see Hobbs S* 

30 or Muny L.E. (1992) in McGraw HiJl Yearbook of Science and Technology, 
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McGraw Hill, New York, N.Y., pp 191-196; or Weissbach and Weissbach (1988) 
Methods for Plant Molecular Biology, Academic Press-, New York, N.Y., pp 
421-463. 

Variant product may also be expressed in an insect system. In one such 
5 system, Autographa californica nuclear polyhidrosis virus (AcNPV) is used as a 
vector to express foreign genes in Spodoptera frugiperda cells or in Trichoplusia 
larvae. The variant product coding sequence may be cloned into a nonessential 
region of the virus, such as the polyhedrin gene, and placed under control of the 
polyhedrin promoter. Successful insertion of variant coding sequence will render 

10 the polyhedrin gene inactive and produce recombinant virus lacking coat protein 
coat. The recombinant viruses are then used to infect S. fi*ugiperda cells or 
Trichoplusia larvae in which variant protein is expressed (Smith et al, J. Virol 
46:584 ; (1983); Engelhard, E.K. etal, Proc. Nat. Acad Set. 91:3224-7, (1994)), 
In mammalian host cells, a number of viral-based expression, systems may 

j 5 be utilized. In cases where an adenovirus is used as an expression vector, a 
variant product coding sequence may be ligated into an adenovirus 
transcription/translation complex consisting of the late promoter and tripartite 
leader sequence. Insertion in. a nonessential El or E3 region of the viral genome 
will result in a viable virus capable of expressing variant protein in infected host 

20 cells (Logan and Shenk, Proc. Natl Acad. ScL 81:3655-59, (1984). In addition, 
transcription enhancers, such as the Rous sarcoma virus (RS V) enhancer, may be 
used to increase expression in mammalian host cells* 

Specific initiation signals may also be required for efficient translation of 
a variant product coding sequence. These signals include the ATG initiation 

25 codon and adjacent sequences. In cases where variant product coding sequence, 
its initiation codon and upstream sequences are inserted into the appropriate 
expression vector, no additional translations! control signals may be needed. 
However, in cases where only coding sequence, or a portion thereof, is inserted, 
exogenous transcriptional control signals including the ATG initiation codon 

30 must be provided. Furthermore, the initiation codon must be in the correct 



reading frame to ensure transcription of the entire insert. Exogenous 
transcriptional elements and initiation codons can be of various origins, both 
natural and synthetic. The efficiency of expression may be enhanced by the 
inclusion of enhancers appropriate to the cell system, in use (Scharf, B, et al, 

5 (1994) Results ProbL Cell Differ., 20:125-62, (1994); Bittner et al., Methods in 
Enzymol 153:516^544, (1987)). 

In a further embodiment, the present invention relates to host cells 
containing the above-described constructs. The host cell can be a higher 
eukaryotic cell, such as a mammalian cell, or a lower eukaryotic ceil, such as a 

io yeast cell, or the host cell can be a prokaryotic cell, such as a bacterial cell. 
Introduction of the construct into the host cell can be effected by calcium 
phosphate transfection, DEAE-Dextran mediated transfection, or electroporation 
(Davis, L., Dibner, M., and Battey, I. (1986) Basic Methods in Molecular 
Biology). Cell-free translation systems can. also be employed to produce 

15 polypeptides using RNAs derived from the DNA constructs of the present 
invention. 

A host cell strain may be chosen for its ability to modulate the expression 
of the inserted sequences or to process the expressed protein in the desired 
fashion. Such modifications of the protein include, but are not limited to, 

20 acetylation, carboxylation* glycosylation, phosphorylation, lipidation and 
acylatiom Post-translational processing which cleaves a "pre-pro" form of the 
protein may also be important for correct insertion, folding and/or function. 
Different host cells such as CHO, HeLa, MDCK, 293, WI38, etc, have specific 
cellular machinery and characteristic mechanisms for such post-translational 

25 activities and may be chosen to ensure the correct modification and processing of 
the introduced, foreign protein. 

For long-term, high-yield production of recombinant proteins, stable 
expression is preferred. For example, cell lines which stably express variant 
product may be transformed using expression vectors which contain viral origins 

30 of replication or endogenous expression elements and a selectable marker gene. 
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Following the introduction of the vector, cells may be allowed to grow for 1-2 
day$ in an enriched media before they are switched to selective media. The 
purpose of the selectable marker is to confer resistance to selection, and its 
presence allows growth and recovery of ceils which successfully express the 
5 introduced sequences. Resistant clumps of stably transformed cells can be 
proliferated using tissue culture techniques appropriate to the cell type. 

Any number of selection systems may be used to recover transformed cell 
lines. These include, but are not limited to, the herpes simplex virus thymidine 
kinase (Wigler M. ? et al 9 Celt 11:223-32, (1977)) and adenine 

io phospboribosyltransferase (Lowy I., et aL 9 Cell 22:817-23, (1980)) genes which 
can be employed in tk- or aprt- cells, respectively. Also, antimetabolite, 
antibiotic or herbicide resistance can be used as the basis for selection; for 
example, dhfr which confers resistance to methotrexate (Wigler M. ? et aL } Proc. 
Natl Acad. Set. 77:3567-70, (1980)); npt, which confers resistance to the 

15 aminoglycosides neomycin and G-418 (Colbere-Garapin, F. et al, J. Mol Biol^ 
150:1-14, (1981)) and als or pat, which confer resistance to chlorsulfuron and 
phospfajnotricin acdyltransferase, respectively (Murry, supra). Additional 
selectable genes have been described, for example, trpB, which allows cells to 
utilize indole in place of tryptophan, or hisD, which allows cells to utilize histinol 

20 in place of histidine (Hartman S.C, and R.C. Mulligan, Proc, Natl Acad. ScL 
85:8047-51, (1988)). The use of visible markers has gained popularity with such 
markers as anthocyanins, beta-glucuronidase and its substrate, GUS, and 
luciferase and its substrates, luciferin and ATP, being widely used not only to 
identify transformants, but also to quantify the amount of transient or stable 

25 protein expression attributable to a specific vector system (Rhodes, C.A. et. al, 
Methods Mol Biol, 55:121-131, (1995)). 

Host cells transformed with a nucleotide sequence encoding variant 
product may be cultured under conditions suitable for the expression and 
recovery of the encoded protein from cell culture. The product produced by a 

so recombinant cell may be secreted or contained intracellular^ depending on the 
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sequen.ce and/or the vector used. As will be understood by those of sfciU in the 
art, expression vectors containing nucleic acid sequences encoding variant 
product can be designed with signal sequences which direct secretion of variant 
product through aprokaryotic or eukaryotic cell membrane. 
5 The variant product may also be expressed as a recombinant protein with 

one or more additional polypeptide domains added to facilitate protein 
purification. Such purification facilitating domains include, but are not limited 
to, metal chelating peptides such as histidine-tryptophan modules that allow 
purification on immobilized metals, protein A domains that allow purification on 

10 immobilized immunoglobulin, and the domain utilized in the FLAGS 
extension/affinity purification system (Immunex Corp ? Seattle, Wash.). The 
inclusion of a protease-cleavable polypeptide linker sequence between the 
purification domain and variant product is useful to facilitate purification. One 
such expression vector provides for expression of a fusion protein compromising 

15 a variant polypeptide fused to a polybistidine region separated by an, enterokinase 
cleavage site. The histidine residues facilitate purification on IMIAC 
(immobilized metal ion affinity chromatography, as described in Porath, et aL, 
Protein Expression and Purification, 3:263-281, (1992)) while the enterokinase 
cleavage site provides a means for isolating variant polypeptide from the fusion 

20 protein* pGEX vectors (Promega, Madison, Wis.) may also be used to express 
foreign polypeptides as fusion proteins with, glutathione S-transferase (GST). In 
general, such fusion proteins are soluble and can easiJy be purified from lysed 
cells by adsorption to ligand-agaxose beads (e.g., glutathione-agarose in the case 
of GST-fusions) followed by eiution in the presence of free ligand. 

25 Following transformation of a suitable host strain and growth of the host 

strain to an appropriate cell density, the selected promoter is induced by 
appropriate means (e*g. ? temperature shift or chemical induction) and cells are 
cultured for an additional period, Cells are typically harvested by centxifugation, 
disrupted by physical or chemical means, and the resulting crude extract retained 

30 for further purification. Microbial cells employed in expression of proteins can 
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be disrupted by any convenient method, including freeze-thaw cycling, 
sonication, mechanical disruption, or use of cell lysing agents, or other methods, 
which are well know to those skilled in the art. 

The variant products can be recovered and purified from recombinant cell. 
5 cultures by any of a number of methods well known in the art, including 
ammonium sulfate or ethanol precipitation, acid extraction, anion or cation 
exchange chromatography, phosphocelluiose chromatography, hydrophobic 
interaction chromatography, affinity chromatography, hydroxylapatite 
chromatography, and lectin chromatography. Protein refolding steps can be used, 
jo as necessary, in completing configuration of the mature protein. Finally, high 
performance liquid chromatography (HPLC) can be employed for final 
purification steps. 

C. Diagnostic applications utilizing nucleic acid sequences 

15 The nucleic acid sequences of the present invention may be used for a 

variety of diagnostic purposes. The nucleic acid sequences may be used to detect 
and quantitate expression of the variant in patient's cells, e.g. biopsied tissues, by 
detecting the presence of mRNA coding for variant product. Alternatively, the 
assay may be used to detect soluble variant in the serum or blood. This assay 

20 typically involves obtaining total mRNA from the tissue or serum and contacting 
the mRNA with a nucleic acid probe. The probe is a nucleic acid molecule of at 
least 20 nucleotides, preferably 20-30 nucleotides, capable of specifically 
hybridizing with a sequence included within the sequence of a nucleic acid 
molecule encoding variant product under hybridizing conditions* detecting the 

25 presence of mRNA hybridized to the probe, and thereby detecting the expression 
of variant. This assay can be used to distinguish between absence, presence, and 
excess expression of variant product and to monitor levels of variant expression 
during therapeutic intervention. In addition, the assay may be used to compare the 
levels of the variant of the invention to the levels of the original sequence from 
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which it has been varied or to levels of other variants, which comparison may 
have some physiological meaning. 

The invention also contemplates the use of the nucleic acid sequences as a 
diagnostic for diseases resulting from inherited defective variant sequences* or 
5 diseases in which the ratio of the amount of the original sequence from which the 
variant was varied to the novel variants of the invention is altered. These 
sequences can be detected by comparing the sequences of the defective (i.e., 
mutant) variant coding region with that of a normal coding region. Association 
of the sequence coding for mutant variant product with abnormal variant product 

j o activity may be verified, In addition, sequences encoding mutant variant products 
can be inserted into a suitable vector for expression in a functional assay system 
(e.g., colorimetric assay, complementation experiments in a variant protein 
deficient strain of HEK293 cells) as yet another means to verify or identify 
mutations. Once mutant genes have been identified, one can then screen 

j 5 populations of interest for carriers of the mutant gene. 

Individuals carrying mutations in the nucleic acid sequence of the present 
invention may be detected at the DNA level by a variety of techniques. Nucleic 
acids used for diagnosis may be obtained from a patient's cells, including but not 
limited to such as from blood, urine, saliva, placenta, tissue biopsy and autopsy 

20 material. Genomic DNA may be used directly for detection or may be amplified 
enzymatically by using PCR (Saiki, at al, Nature 324:163-166, (1986)) prior to 
analysis. RNA or cDNA may also be used for the same purpose. As an example, 
PCR primers complementary to the nucleic acid of the present invention can be 
used to identify and analyze mutations in the gene of the present invention. 

25 Deletions and insertions can be detected by a change in size of the amplified 
product in comparison to the normal genotype. 

Point mutations can be identified by hybridizing amplified DNA to 
radiolabeled RNA of the invention or alternatively, radiolabeled antisense DNA 
sequences of the invention. Sequence changes at specific locations may also be 

30 revealed by nuclease protection assays, such RNase and SI protection or the 
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chemical cleavage method (e.g. Cotton, et aWroc. Natl Acad. Set USA, 
85:4397-4401, (1985)), or by differences in melting temperatures. "Molecular 
beacons" (Kostrikis L.G, et aL, Science 279:1228-1229, (1998)), hairpin-shaped, 
single-stranded synthetic oligo- nucleotides containing probe sequences which 
5 are complementary to the nucleic acid of the present invention, may also be used 
to detect point mutations or other sequence changes as well as monitor expression 
levels of variant product. Such diagnostics would be particularly useful for 
prenatal testing. 

Another method for detecting mutations uses two DNA probes which are 
io designed to hybridize to adjacent regions of a target, with abutting bases, where 
the region of known or suspected mutation(s) is at or near the abutting bases. 
The two probes may be joined at the abutting bases, e.g., in the presence of a 
ligase enzyme, but only if both probes are correctly base paired In the region of 
probe junction. The presence or absence of mutations is then detectable by the 
15 presence or absence of ligated probe. 

Also suitable for detecting mutations in the variant product coding 
sequence are oligonucleotide array methods based on sequencing by 
hybridization (SBH), as described, for example, in U.S. Patent No. 5,547,839. In 
a typical method, the DNA target analyte is hybridized with an array of 
20 oligonucleotides formed on a microchip. The sequence of the target can then be 
"read" from the pattern of target binding to the array, 

D. Gene mapping utilizing nucleic acid sequences 

The nucleic acid sequences of the present invention are also valuable for 
25 chromosome identification* The sequence is specifically targeted to and can 
hybridize with a particular location on an individual human chromosome. 
Moreover, there is a current need for identifying particular sites on the 
chromosome. Few chromosome marking reagents based on actual sequence data 
(repeat polymorphisms) are presently available for marking chromosomal 
30 location. The mapping of DNAs to chromosomes according to the present 
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inveotion is an important first step in correlating those sequences with genes 
associated with disease* 

Briefly, sequences can be mapped to chromosomes by preparing FCR 
primers (preferably 20-30 bp) from the variant cDNA. Computer analysis of the 

5 V untranslated region is used to rapidly select primers that do not span more than 
one exon in the genomic DNA, which would complicate the amplification 
process- These primers are then used for PGR screening of somatic cell hybrids 
containing individual human chromosomes. Only those hybrids containing the 
human gene corresponding to the primer will yield an amplified fragment. 

10 PCR mapping of somatic cell hybrids or using instead radiation hybrids 

are rapid procedures for assigning a particular DNA to a particular chromosome. 
Using the present invention, with the same oligonucleotide primers, 
sublocalization can be achieved with panels of fragments from specific 
chromosomes or pools of large genomic clones in an analogous manner. Other 

15 mapping strategies that can similarly be used to map to its chromosome include in 
situ hybridization, prescreening with labeled flow-sorted chromosomes and 
preselection by hybridization to construct chromosome specific-cDNA libraries. 

Fluorescence in situ hybridization (FISH) of a cDNA clone to a metaphase 
chromosomal spread can be used to provide a precise chromosomal location in 

20 one step. This technique can be used with cDNA as short as 50 or 60 bases. For a 
review of this technique, see Verma et aU Human Chromosomes: a Manual of 
Basic Techniques, (1988) Pergamon Press, New York, 

Once a sequence has been mapped to a precise chromosomal location^ the 
physical position of the sequence on the chromosome can be correlated with 

25 genetic map data* Such, data are found, for example, in the OMIM database 
(Center for Medical Genetics, Johns Hopkins University 3 Baltimore, MD and 
National Center for Biotechnology Information, National Library of Medicine, 
Bethesda, MD), The OMIM gene map presents the cytogenetic map location of 
disease genes and other expressed genes. The OMIM database provides 

30 information on diseases associated with the chromosomal location. Such 
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associations include the results of linkage analysis mapped to this interval, and 
the correlation of translocations and other chromosomal aberrations m this area 
with the advent of polygenic diseases* such as cancer, in general and prostate 
cancer in particular. 

5 

E, Therapeutic applications of nucleic acid sequences 

Nucleic acid sequences of the invention may also be used for therapeutic 
purposes. Turning first to the second aspect of the invention (i.e. inhibition of 
expression of variant), expression of variant product may be modulated through 

10 antisense technology, which controls gene expression through hybridization of 
complementary nucleic acid sequences, i,e, antisense DNA or KNA S to the 
control, 5 1 or regulatory regions of the gene encoding variant product. For 
example, the 5' coding portion of the nucleic acid sequence sequence which 
codes for the product of the present invention is used to design an antisense 

15 oligonucleotide of from about 10 to 40 base pairs in length. Oligonucleotides 
derived from the transcription start site, e.g. between positions -10 and +10 from 
the start site, are preferred. An antisense DNA oligonucleotide is designed to be 
complementary to a region of the nucleic acid sequence involved in transcription 
(Lee et al, NucL Acids, Res,, 6:3073, (1979); Cooney et ai. 5 Science 241:456, 

20 (1988); and Dervan e( al, Science 251:1360, (1991)), thereby preventing 
transcription and the production of the variant products. An antisense RNA 
oligonucleotide hybridizes to the mRNA in vivo and blocks translation of the 
mRNA molecule into the variant products (Okano X Nevrochern. 56:560, 
(1991)). The antisense constructs can be delivered to cells by procedures known 

25 in the art such that the antisense RNA or DNA may be expressed in vfvo. The 
antisense may be antisense mRNA or DNA sequence capable of coding such 
antisense mRNA. The antisense mRNA or the DNA coding thereof can be 
complementary to the full sequence of nucleic acid sequences coding for the 
variant protein or to a fragment of such a sequence which is sufficient to inhibit 

30 production of a protein product. 
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Turning now to the first aspect of the invention, i.e, expression of variant, 
expression of variant product may be increased by providing coding sequences 
for coding for said product under the control of suitable control elements ending 
its expression in the desired host 
5 The nucleic acid sequences of the invention may be employed m 

combination with a suitable pharmaceutical carrier* Such compositions comprise 
a therapeutically effective amount of the compound, and a pharmaceutically 
acceptable carrier or excipient Such a carrier includes but is not limited, to saline, 
buffered saline, dextrose, water, glycerol, ethanol, and combinations thereof The 
10 formulation should suit the mode of administration. 

The products of the invention as well as any activators and deactivators 
compounds (see below) which are polypeptides, may also be employed in 
accordance with the present invention by expression of such polypeptides in vivo, 
which is often referred to as "gene therapy/' Cells from a patient may be 
15 engineered with a nucleic acid sequence (DNA or RNA) encoding a polypeptide 
ex vivo, with the engineered cells then being provided to a patient to be treated 
with the polypeptide, Such methods are well-known in the art. For example, cells 
may be engineered by procedures known in the art by use of a retroviral particle 
containing RNA encoding a polypeptide of the present invention. 
20 Similarly, cells may be engineered in vivo for expression of a polypeptide 

in vivo by procedures known in the art. As known in the art, a producer cell for 
producing a retroviral particle containing RNA encoding the polypeptide of the 
present invention may be administered to a patient for engineering cells in vivo 
and expression of the polypeptide in vivo. These and other methods for 
25 administering a product of the present invention by such method should be 
apparent to those skilled in the art from the teachings of the present invention- 
For example, the expression vehicle for engineering cells may be other than a 
retrovirus, for example, an adenovirus which may be used to engineer cells in 
vivo after combination with a suitable delivery vehicle. 
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Retroviruses from which the retroviral pjasmid vectors mentioned above 
may be derived include, but are not limited to, Moloney Murine Leukemia Virus, 
spleen necrosis virus, retroviruses such as Rous Sarcoma Virus, Harvey Sarcoma 
Virus, avian leukosis virus, gibbon ape leukemia virus, human immunodeficiency 
5 virus, adenovirus, Myeloproliferative Sarcoma Virus, and mammary tumor virus. 

The retroviral plasmid vector is employed to transduce packaging cell 
lines to form producer cell lines. Examples of packaging cells which may be 
transfected include, but are not limited to, the PE501, PAS 17, psi-2, psi-AM, 
PA12, T19-14X, VT-19-17-H2, psi-CRE, psi-CRlP, GP+E-86, GP+emAmI2, 

io and DAN cell lines as described in Miller {Human Gene Therapy, Vol. 1, 
Pg* 5-14, (1990)). The vector may transduce the packaging cells through any 
means known in the art. Such means include, but are not limited to, 
electroporation, the use of liposomes, and CaPCU precipitation. In one alternative, 
the retroviral plasmid vector may be encapsulated into a liposome, or coupled to a 

j 5 lipid, and then administered to a host 

The producer cell line generates infectious retroviral vector particles 
which include the nucleic acid sequence(s) encoding the polypeptides. Such 
retroviral vector particles then may be employed, to transduce eukaryotic cells, 
either in vitro or in vivo. The transduced eukaryotic cells will express the nucleic 

20 acid sequence(s) encoding the polypeptide. Eukaryotic cells which may be 
transduced include, but are not limited to, embryonic stem cells, embryonic 
carcinoma cells, as well as hematopoietic stem cells, hepatocytes, fibroblasts, 
myoblasts, keratinocytes, endothelial cells, and bronchial epithelial cells. 

The genes introduced into cells may be placed under the control of 

25 inducible promoters, such as the radiation-inducible Egr-1 promoter, (Maceri, 
HX, et al, Cancer Res> 7 56(19):43 1 1 (1996)), to stimulate variant production or 
antisense inhibition in response to radiation, eg., radiation therapy for treating 
tumors. 
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Example IV. Variant product 

The substantially purified variant product of the invention has been 
defined above as the product coded from the nucleic acid sequence of the 
invention. Preferably the amino acid sequence is an amino acid sequence having 
5 at least 90% identity, preferably 95% identity to any one of the sequences 
identified as SEQ ID NO: 27 to SEQ ID NO: 52 provided that the amino acid 
sequence is not identical to that of the original sequence from which it has been, 
varied. The protein or polypeptide may be in mature and/or modified form, also 
as defined above. Also contemplated are protein fragments having at least 10 

io contiguous amino acid residues, preferably at least 10-20 residues, derived from, 
the variant product, as well as homologues as explained above. 

The sequence variations are preferably those that are considered conserved 
substitutions, as defined above. Thus, for example, a protein with a sequence 
having at least 90% identity, preferably 95% sequence identity with any of the 

1 5 products identified as SEQ ID NO: 27 to SEQ ID NO: 52, preferably by utilizing 
conserved substitutions as defined above is also part of the invention, and 
provided that it is not identical to the original peptide from which it has been 
varied. In a more specific embodiment, the protein has or contains any one of the 
sequence identified as SEQ ID NO: 27 to SEQ ID NO: 52. The variant product 

20 may be (i) one in which one or more of the amino acid residues in a sequence 
listed above are substituted with a conserved or non-conserved amino acid 
residue (preferably a conserved amino acid residue), or (ii) one in which one or 
more of the amino acid residues includes a substituent group ? or (iii) one in which 
the variant product is fused with another compound, such as a compound to 

25 increase the half-life of the protein (for example, polyethylene glycol (PEG)), or a 
moiety which, serves as targeting means to direct the protein to its target tissue or 
target cell population (such as an antibody), or (iv) one in which additional amino 
acids are fused to the variant product. Such fragments, variants and derivatives 
are deemed to be within the scope of those skilled in the art from, the teachings 

30 herein. 
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A. Preparation of variant product 

Recombinant methods for producing and isolating the variant product, and 
fragments of the protein are described above. 
5 In addition to recombinant production, fragments and portions of variant 

product may be produced by direct peptide synthesis using solid-phase techniques 
(cf. Stewart et al, (1969) Solid-Phase Peptide Synthesis, WH Freeman Co ? San 
Francisco; Merrifield J., X Am, Chem. Soc, 85:2149-2154, (1963)). In vitro 
peptide synthesis may be performed using manual techniques or by automation* 
10 Automated synthesis may be achieved, for example, using Applied Biosystems 
431 A Peptide Synthesizer (Perkin Elmer, Foster City 5 Calif.) m accordance with 
the instructions provided by the manufacturer. Fragments of variant product may 
be chemically synthesized separately and combined using chemical methods to 
produce the full length molecule, 

15 

B. Therapeutic uses and compositions utilizing the variant product 

The variant product of the invention is generally useful in treating diseases 
and disorders which are characterized by a lower than normal level of variant 
20 expression, and or diseases which can be cured or ameliorated by raising the level 
of the variant product, even if the level is normal. 

Variant products or fragments may be administered by any of a number of 
routes and methods designed to provide a consistent and predictable 
concentration of compound at the target organ or tissue. The product-containing 
25 compositions may be administered alone or in combination with other agents, 
such as stabilizing compounds, and/or in combination with, other pharmaceutical 
agents such as drugs or hormones. 

Variant product-containing compositions may be administered by a 
number of routes including, but not limited to oral 5 intravenous, intramuscular, 
30 transdermal, subcutaneous, topical, sublingual, or rectal means as well as by nasal 
application, variant product-containing compositions may also be administered 
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via liposomes. Such, administration routes and appropriate formulations are 
generally known to those of skill in the art. 

The product can be given via intravenous or intraperitoneal injection. 
Similarly, th e product may be injected to other localized regions of the body. The 
5 product may also be administered via nasal insufflation. Enteral administration is 
also possible. For such administration, the product should be formulated into an 
appropriate capsule or elixir for oral administration, or into a suppository for 
rectal administration. 

The foregoing exemplary administration modes will likely require that the 
10 product be formulated into an appropriate carrier, including ointments, gels, 
suppositories. Appropriate formulations are well known to persons skilled in the 
art. 

Dosage of the product will vary, depending upon the potency and 
therapeutic index of the particular polypeptide selected. 

15 A therapeutic composition for use in the treatment method can include the 

product in a sterile injectable solution, the polypeptide in an oral delivery vehicle, 
the product in an aerosol suitable for nasal administration, or the product in a 
nebulized form, all prepared according to well icnown methods, Such 
compositions comprise a therapeutically effective amount of the compound, and a 

20 pharmaceutical^ acceptable carrier or excipient. Such a carrier includes but is not 
limited to saline, buffered saline, dextrose, water, glycerol, etbanol, and 
combinations thereof The product of the invention may also be used to modulate 
endothelial differentiation and proliferation as well as to modulate apoptosis 
either ex vivo or in vitro, for example, in cell cultures. 

25 

Example V. Screening methods for activators and deactivators (inhibitors) 

The present invention also includes an assay for identifying molecules, 
such a.s synthetic drugs, antibodies, peptides, or other molecules, which have a 
30 modulating effect on the activity of the variant product, e.g. activators or 
deactivators of the variant product of the present invention. Such an assay 
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comprises the steps of providing an variant product encoded by the nucleic acid 
sequences of the present invention, contacting the variant protein with one or 
more candidate molecules to determine the candidate molecules modulating 
effect on the activity of the variant product, and selecting from the molecules a 
5 candidate's molecule capable of modulating variant product physiological 
activity. 

The variant product, its catalytic or immunogenic fragments or 
oligopeptides thereof, can be used for screening therapeutic compounds in. any of 
a variety of drug screening techniques. The fragment employed in such a test 

10 may be free in solution affixed to a solid support, borne on a cell membrane or 
located intracellularly. The formation of binding complexes, between variant 
product and the agent being tested, may be measured Alternatively, the activator 
or deactivator may work by serving as agonist or antagonist respectively, of the 
variant receptor, binding entity or target site, and their effect may be determined 

15 in connection with any of the above. 

Another technique for drug screening which may be used provides for 
high throughput screening of compounds having suitable binding affinity to the 
variant product is described in detail by Geysen in PCT Application WO 
84/03564, published on Sep. 13, 1984. In summary large numbers of different 

20 small peptide test compounds are synthesized on a solid substrate, such as plastic 
pins or some other surface. The peptide test compounds are reacted with the full 
variant product or with fragments of variant product and washed. Bound variant 
product is then detected by methods well known in. the art. Substantially purified 
variant product can also be coated directly onto plates for use in the 

25 aforementioned drug screening techniques. Alternatively, non-neutralizing 
antibodies can be used to capture the peptide and immobilize it on a solid 
support. 

Antibodies to the variant product, as described in Example VI below, may 
also be used in screening assays according to methods well known in the art. For 
30 example, a "sandwich" assay may be performed* in which an anti-variant 
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antibody is affixed to a solid surface such as a microliter plate and variant 
product is added. Such an assay can be used to capture compounds which bind to 
the variant product Alternatively, such an assay may be used to measure the 
ability of compounds to influence with the binding of variant product to the 
5 variant receptor, and then select those compounds which effect the binding. 

Example VI. Anti-variant antibodies 
A. Synthesis 

In still another aspect of the invention, the purified variant product is used 

10 to produce anti-variant antibodies which have diagnostic and therapeutic uses 
related to the activity, distribution, and expression of the variant product. 

Antibodies to the variant product may be generated by methods well 
known in the art. Such antibodies may include, but are not limited to, polyclonal, 
monoclonal, chimeric, humanized, single chain, Fab fragments and fragments 

15 produced by an Fab expression library. Antibodies, i.e., those which inhibit 
dimer formation, are especially preferred for therapeutic use. 

A fragment of the variant product for antibody induction does not require 
biological activity but have to feature immunological activity; however, the 
protein fragment or oligopeptide must be antigenic. Peptides used to induce 

2o specific antibodies may have an amino acid sequence consisting of at least five 
amino acids, preferably at least 10 amino acids of the sequences specified in any 
one of SEQ ID NO: 27 to SEQ ID NO: 52. Preferably they should mimic a 
portion of the amino acid sequence of the natural protein and may contain the 
entire amino acid sequence of a small, naturally occurring molecule. Short 

25 stretches of variant protein amino acids may be fused with, those of another 
protein such as keyhole limpet hemocyanin and antibody produced against the 
chimeric molecule. Procedures well known in the art can be used for the 
production of antibodies to variant product. 

For the production of antibodies, various hosts including goats, rabbits, 

30 rats, mice, etc may be immunized by injection with variant product or any 
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portion, fragment or oligopeptide which retains immunogenic properties. 
Depending on the host species, various adjuvants may be used to increase 
immunological response. Such adjuvants include but are not limited to Freund's, 
mineral geJs such as aluminum hydroxide, and surface active substances such as 

5 lysolecithin, pluronic polyols, polyanions, peptides, oil emulsions, keyhole limpet 
hemocyanin, and dinitropbenol, BCG (bacilli Calmette-Guerm) and 
Corynebacterium parvum are potentially useful human adjuvants. 

Monoclonal antibodies to variant protein may be prepared using any 
technique which provides for the production of antibody molecules by continuous 

10 cell lines in culture. These include but are not limited to the hybridoma technique 
originally described by Koehler and Milstein (Nature 256:495-497, (1975)), the 
human B-cell hybridoma technique (Kosbor et al, Immunol Today 4:72, (1983); 
Cote etal, Proc. Natl Acad Set 80:2026-2030, (1983)) and the EBV-hybridorna 
technique (Cole, et al } Mol Cell Biol 62: 109~120 ? (1984)), 

15 Techniques developed for the production of "chimeric antibodies" the 

splicing of mouse antibody genes to human antibody genes to obtain a molecule 
with appropriate antigen specificity and biological activity can also be used 
(Morrison et al } Proc. Natl Acad. Set 81:6851-6855, (1984); Neuberger et al t 
Nature 312:604-608, (1984); Takeda et al, Nature 314:452-454, (1985)). 

20 Alternatively, techniques described for the production of single chain antibodies 
(U.S. Pat No. 4,946,778) can be adapted to produce single-chain antibodies 
specific for the variant protein. 

Antibodies may also be produced by inducing in vivo production in the 
lymphocyte population or by screening recombinant mununoglobulm libraries or 

25 panels of highly specific binding reagents as disclosed in Orlandi et al. (Proc. 
Natl Acad, ScL 86:3833-3837, 1989)), and Winter G and Milstein C., (Nature 
349:293-299,(1991)). 

Antibody fragments which contain specific binding sites for variant 
protein may also be generated. For example, such fragments include, but are not 

30 limited to, the F(ab f ) 2 fragments which can be produced by pepsin digestion of 



-53- 

the antibody molecule arid the Fab fragments which can be generated by reducing 
the disulfide bridges of the F(ab')2 fragments. Alternatively, Fab expression 
libraries may be constructed to allow rapid and easy identification of monoclonal 
Fab fragments with the desired specificity (Huse W.D. et al } Science 
5 256:1275-1281,(1989)). 

B. Diagnostic applications of antibodies 

A variety of protocols for competitive binding or immunoradiometric 
assays using either polyclonal or monoclonal antibodies with established 

jo specificities are wel] known in the art* Such immunoassays typically involve the 
formation of complexes between the variant product and its specific antibody and 
the measurement of complex formation. A two-site, monoclonal-based 
immunoassay utilizing monoclonal antibodies reactive to two noninterfering 
epitopes on a specific variant product is preferred, but a competitive binding 

15 assay may also be employed- These assays are described in Maddox D.E., et al, 
(J. Exp. Med. 158:1211, (1983)), 

Antibodies which specifically bind variant product are useful for the 
diagnosis of conditions or diseases characterized by expression of the novel 
variant of the invention (where normally it is not expressed) by over or under 

20 expression of variant as well as for detection of diseases in which the proportion 
between the amount of the variants of the invention and the original sequence 
from which it varied is altered. Alternatively, such antibodies may be used in 
assays to monitor patients being treated with variant product, its activators, or its 
deactivators. Diagnostic assays for variant protein include methods utilizing the 

25 antibody and a label to detect variant product in human body fluids or extracts of 
cells or tissues. The products and antibodies of the present invention may be 
used with or without modification. Frequently, the proteins and antibodies will be 
labeled by joining them, either covalently or noncovalently, with a reporter 
molecule, A wide variety of reporter molecules are known in the art. 



A variety of protocols for measuring the variant product, using either 
polyclonal or monoclonal antibodies specific for the respective protein are known 
in the art. Examples include enzyme-linked immunosorbent assay (ELISA), 
radioimmunoassay (RIA), and fluorescent activated cej] sorting (FACS). As 
5 noted above, a two-site, monoclonal-based immunoassay utilizing monoclonal 
antibodies reactive to two non-interfering epitopes on variant product is 
preferred, but a competitive binding assay may be employed. These assays are 
described, among other places, in Maddox, el al (supra). Such protocols provide 
a basis for diagnosing altered or abnormal levels of variant product expression- 

10 Normal or standard values for variant product expression are established by 
combining body fluids or cell extracts taken from normal subjects, preferably 
human, with antibody to variant product under conditions suitable for complex 
formation which, are well known in the art. The amount of standard complex 
formation may be quantified by various methods, preferably by photometric 

15 methods. Then, standard values obtained from normal samples may be compared 
with values obtained from samples from subjects potentially affected by disease. 
Deviation between standard and subject values establishes the presence of disease 
state. 

The antibody assays are useful to determine the level of variant product 
20 present in a body fluid sample, in order to determine whether it is being 
expressed at all, whether it is being overexpressed or underexpressed in the 
tissue, or as an indication of how variant levels of variable products are 
responding to drug treatment 

By another aspect the invention concerns methods for determining the 
25 presence or level of various anti-variant antibodies in a biological sample 
obtained from patients, such as blood or serum sample using as an antigen the 
variant product. Detennination. of said antibodies may be indicative to a plurality 
of pathological conditions or diseases. 
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C. Therapeutic uses of antibodies 

Id. addition to their diagnostic use the antibodies may have a therapeutical 
utility in blocking or decreasing the activity of the variant product in pathological 
conditions where beneficial effect can be achieved by such a decrease. 

5 The antibody employed is preferably a humanized monoclonal antibody, 

or a human Mab produced by known globulin-gene library methods. The 
antibody is administered typically as a sterile solution by IV injection, although 
other parenteral routes may be suitable. Typically, the antibody is administered 
in an amount between about 1-15 mg/kg body weight of the subject. Treatment is 

10 continued, e.g., with dosing every 1-7 days, until a therapeutic improvement is 
seen. 

Although the invention has been described with reference to specific 
methods and embodiments, it is appreciated that various modifications and 
changes may be made without departing from the invention. 
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CLAIMS: 

1. An isolated nucleic acid sequence, of an alternative splicing variant, selected 
from the group consisting of: 

(i) the nucleic acid sequence depicted in. any one of SEQ ID NO: 1 to 
5 SEQ ID NO: 26; 

(ii) nucleic acid sequences having at least 90% identity with the 
sequence of (i) with the proviso that each sequence is different than the original 
nucleic acid sequence from which, the sequences of (i) have been varied by 
alternative splicing; and 

10 (iii) fragments of (i) or (ii) of at least 20 b.p. ? provided that said fragment 

contains a sequence which is not present, as a continuous stretch of nucleotides* in 
the original nucleic acid sequence from which the sequences of (i) have been 
varied by alternative splicing* 

2. An isolated nucleic acid sequence according to Claim 1 ? having at least 95% 
] 5 identity to any one of SEQ ID NO: 1 to SEQ ID NO: 26. 

3. An isolated nucleic acid sequence complementary to the nucleic acid 
sequence of Claim 1. 

4. An amino acid sequence selected from the group consisting of: 

(i) an amino acid sequence coded by the isolated nucleic acid sequence 
20 of alternative splice variants of Claim 1 ; 

(ii) homologues of the amino acid sequences of (i) in which one or more 
amino acids has been added, deleted, replaced or chemically 
modified in the region or adjacent to the region where the amino 
acid sequences differs from the original amino acid sequence, coded 

25 by the original nucleic acid sequence from, which the variant has been 

varied. 

5. An amino acid sequence according to Claim 5 ? as depicted in any one of 
SEQ ID NO: 27 to SEQ ID NO: 52, 

6* An isolated nucleic acid sequence coding for any one of the amino acid 
30 sequences of Claim 5. 



7. A purified antibody which binds specifically to any of the amino acid 
sequence of Claim 4. 

8. An expression, vector comprising any one of the nucleic acid sequences of 
Claim 1 and control elements for the expression of the nucleic acid sequence in a 
suitable host 

9. An expression vector comprising any one of the nucleic acid sequences of 
Claim. 3, and control elements for the expression of the nucleic acid sequences in a 
suitable host. 

10. A host eel 1 transfected by the expression vector of Claim 8 . 

11. A host cell transfected by the expression vector of Claim 9. 

12. A pharmaceutical composition comprising a pharmaceutical^ acceptable 
carrier and as an active ingredient an agent selected from the group consisting of; 

(i) the expression vector of Claim 8; and 

(ii) any one of the amino acid sequences of Claim 4. 

13. A pharmaceutical composition according to Claim 12, for treatment of 
diseases which can be ameliorated or cured by raising the level of any one of the 
amino acid sequences depicted in SEQ ID NO: 27 to SEQ ID NO: 52. 

14. A pharmaceutical composition comprising a pharmaceutical^ acceptable 
carrier and as an active ingredient an agent selected from the group consisting of: 

(i) any one of the nucleic acid sequences of Claim 3 ; 

(ii) the expression vector of Claim 9; and 
(Hi) the purified antibody of Claim. 7- 

15. A pharmaceutical composition according to Claim 14, for treatment of 
diseases which can be ameliorated or cured by decreasing the level of any one of 
the amino acid sequences depicted in SEQ ID NO: 27 to SEQ ID NO: 52. 

16. A method for detecting an variant nucleic acid sequence in a biological 
sample, comprising the steps of: 

(a) hybridizing to nucleic acid material of said biological sample any one 
of the nucleic acid sequences of Claim. 1; and 

(b) detecting said hybridization complex; 
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wherein the presence of said hybridization complex correlates with the 
presence of an variant nucleic acid sequence in the said biological sample. 
17. A method for detecting an variant nucleic acid sequence in a biological 
sample, comprising the steps of: 
5 (a) ■ hybridizing to nucleic acid material of said biological sample any one 

of the nucleic acid sequences of Claim 3; and 

(b) detecting said hybridization complex; 

wherein the presence of said hybridization complex correlates with the 
presence of an variant nucleic acid sequence in the said biological sample. 
10 18. A method for determining the level of variant nucleic acid sequences in a 
biological sample comprising the steps of: 

(a) hybridizing to nucleic acid material of said biological sample any one 
of the nucleic acid sequences of Claim 1; and 

(b) determining the amount of hybridization complexes and normalizing 
15 said amount to provide the level of the variant nucleic acid sequences m the 

sample. 

19. A method for determining the level of variant nucleic acid sequences in a 
biological sample comprising the steps of: 

(a) hybridizing to nucleic acid material of said biological sample any one 
20 of the nucleic acid sequences of Claim. 3 ; and 

(b) determining the amount of hybridization complexes and normalizing 
said amount to provide the level of the variant nucleic acid sequences in the 
sample. 

20, A method for determining the ratio between the level of variant of the 
25 nucleic acid sequence in a first biological sample and the level of the original 

sequence from which the variant has been varied by alternative splicing in. a second 
biological sample comprising: 

(a) determming the level of the variant nucleic acid sequence in the first 
biological sample according to the method of Claim 19; 
30 (b) determining the level of the original sequence in the second biological 

sample; and 
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(c) comprising the levels obtained in (a) and (b) to give said ratio. 

21. A method according to Claim 20, wherein said first and said second 
biological samples are the same sample. 

22. A method according to Claim. 20, wherein Hie nucleic acid material of said 
5 biological sample are mRNA transcripts. 

23. A method according to Claim 22, where the nucleic acid sequence is present 
in a nucleic acid chip. 

24* A method for identifying candidate compounds capable of binding to the 
variant product and modulating its activity the method comprising; 
J 0 (0 providing any one of the amino acid sequences as defined in Claim 4; 

(ii) contacting a candidate compound with said amino acid sequence; 

(iii) determining Hie effect of said candidate compound on the biological 
activity of said protein or polypeptide and selecting those compounds which show a 
significant effect on said biological activity. 

15 25, A method according to Claim 24, wherein the compound is an activator and 
the measured effect is increase in the biological activity. 

26. A method according to Claim 24, wherein the compound is an deactivator 
and the effect is decrease in the biological activity. 

27, An activator of any one of the amino acid sequences of Claim 4. 
20 28* An deactivator of any one of the amino acid sequences of Claim 4. 

29. A method for detecting any one of the amino acid sequences of Claim. 4 in a 
biological sample, comprising the steps of: 

(a) contacting with said biological sample the antibody of Claim 7, 
thereby forming an antibody-antigen complex; and 
25 (b) detecting said antibody-antigen complex 

wherein the presence of said antibody-antigen, complex correlates with the 
presence of the desired amino acid in said biological sample. 

30, A method for detecting the level of any one of the amino acid sequence of 
Claim 4 in a biological sample, comprising the steps of: 

50 (a) contacting with said biological sample the antibody of Claim 7, 

thereby forming an antibody-antigen complex; and 
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(b) detecting the amount of said antibody-antigen complex and 
normalizing said amount to provide the level of said amino acid sequence in the 
sample. 

31. A method for determining the ratio between the level of any one of the 
5 amino acid sequences of Claim 4 present in a &st biological sample and the level 

of the original amino acid sequences from which they were varied by alternative 
splicing, present in a second biological sample, the method comprising: 

(a) determining the level of the amino acid sequences of Claim 4 into a 
first sample by the method of Claim 30; 
10 (b) determining the level of the original amino acid sequence in the 
second sample; and 

(c) comparing the level obtained in (a) and (b) to give said ratio. 

32, A method according to Claim 31, wherein said first and said second 
biological samples are the same sample. 

15 33. A method for detecting any one of the antibodies of Claim 7 in a biological 
sample comprising the steps of: 

(a) contacting said biological sample with any one of the amino acid 
sequences of Claim 4 thereby forming an antibody-antigen complex; and 

(b) detecting said antibody-antigen complex 

20 wherein the presence of said antibody-antigen complex correlates with the 

presence of the antibody in said biological sample. 

34* A method for detecting the level of any one of the antibodies of Claim 7 in a 
biological sample comprising the steps of: 

(a) contacting said biological sample with any one of the amino acid 
25 sequences of Claim 4; 

(b) detecting the amount of said antibody-antigen complex and 
normalizing said amount to provide the levels of said antibody in. the sample. 
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ABSTR ACT 

The present invention concerns novel variants produced by alternative 
splicing of known genes, amino acid sequences of the variants as well as their uses 
in detection and therapy. 



SEQUENCE LISTING 



<I10> COMPUGEN LTD 

<120> VARIANTS OF ALTERNATIVE SPLICING 

<130> 1290352 - COMPUGEN 

<X40> 
<141> 

<150> IL 132558 
<151> 1939-10-25 

<160> 52 

<17U> Pa tent In Ver. 2,1 

<210> 1 

<211>. 1530 

<212> Dwa 

<213> Homo sapiens 

<4QG> 1 

ceggccttga ggttttggca gctctggagg atgagagaga acatggccag gggcccttgc 60 
aaegcgccga gatgggtgtc cctgatggtg ctcgtcgcca taggcaccgc cgtgacagcg 120 
gccgtcaacc ctggcgtcgt ggtcaggatc tcccagaegg gcctggacta cgccagccag 3 80 
caggggacgg ccgctctgca gaaggagctg aagaggatca aqattcctga ctactcagac 240 
agctttaaga tcaagcatct tgggaagggg cattatagct tctacagcat ggacatccgt 300 
gaattccagc ttcccagttc ccagataagc atggtgc-cca atgtgggcct taagttctcc 360 
atcagcaacg ccaatotcaa gatcagcggg aaatggaagg cacaaaagag attcttaaae 420 
atgagcggca attttgacct gagcatagaa ggcatgtcca tttcggctga tctgaagctg 480 
ggcagtaacc ccacgtcagg eaagcccacc atcacctgct ccagctgcag cagccacatc 540 
oacagtgtcc acgtgcacot ctcaaagagc aaagtcgggt ggctgatcca actcttccac 600 
aaaaaaattg agtctgcgct tcgaaacaag atgaacagcc aggtctgcga gaaagtgacc 660 
aattctgtat cctctcadgct gcaaccttat ttccagactc tgccagtaat gaccaaaata 720 
gattctgtgg ctggaatcaa ctatggtctg gtggcacctc cagcaaccac ggctgagacc 780 
ctggatgtac agatgaaggg ggagttttac agtgagaacc accacaatcc acctcccttt 840 
gctccaccag tgatggagtt tcccgctgcc catgaccgca tggtatacct gggcctctca 90 0 
gactacttct tcaacacagc cgggcttgta taccaagagg ctggggtctt gaagatgacc 960 
cttagagatg acatgattcc aaaggagtcc aaatttcgac tgacaaccaa gttctttgga 1020 
accttcctac ctgaggt.ggc caagaagttt cccaacatga agatacagat ccatgtctca X080 
gcctccaccc cgccacacct gtctgtgcag cccaccggcc ttaccttcta ccctgccgtg 1140 
gatgtccagg cctttgccgt cctccccaae tcctccctgg cttcectctt cctgattggc 1200 
atgytaagca gttcctgggt tggacagatg aggagcccca gacagtccca acagcactgt 12 60 
ctttggagtc aggagoccat gtgaatcctg tctggattca aacctggact gtgtcactcc 1320 
ggagcctgag gcttgagtca ctgtactcaa tggtgccgae tcctggaggt attcattcac 138 0 
ccagccattc actagtgcgt ttgtttactt attcattcaa ttattcattc agtcaatttc 1440 
toattr.attc anttattcat tccatgttgg cttgaaatat gtgtactgtn ccaattnatc 1500 
uatttatatc tttagtcatt caattatgca 1530 



<210> 2 
<211> 2251 
<212> DNA 

<2l3> Homo sapiens 
<4 00> 2 

ttttttcaqg ggtatgaggc tttattcagg aagagctcaa gatgggaggg gagcccctgg 60 
ggtcccactc ttgtggcetc ttgataccaa ggctcatggg acgctaaaac ctaatgactg 120 
caggcatctg agcttccttg gagtcatctg tggggggcac aggggcagcc tcegagtctac 3 80 



1* * 



2 



ccgtggggyg cacagggggg gccccggagt 
agtcacccgt agcggggatg gggcagcctc 
agtcacccgt qggcggcacg gggggggecc 
CCCcggagtc acccgtgggc ggcacggggg 
gggcggtctc ggagtcacco <jtggggggea 
gcacaygggt ggcctcetgg tcggtcactg 
agcgcaqgaa gttggttctc aggctccgct 
tctccraggta gccgctgttt tccgtagtgt 
cgcccatqtt ggggtcccct gttttggcaa 
agactgtccL gtcttggggc egqtagcccg 
actgaatgtc ntctgcatgg tcggccccca 
gatgggaaaa caggtaggcg taggtcttgg 
gggcaatctc ggtgqgcacc aggaagagga 
tcttattctc ctgggatgqg tcctgggccc 
tggcgcctct qagccccttg gtgattgtga 
ccgtgacttt cttgttgccc ttgttgatgg 
cgtccatgtt gttggtgcct gctatptagt 
tcgggtc&gc ggggatgaag tctccatcaa 
tggggtactc oaggcctqcc agcggcaccfc 
taaccttcag aeactgggec atcctggcgg 
ccaccttttt ggoccagaag aqtgggtttt 
cgctctggct gatggctcgc cggatgaggc 
agacgctggc acctccagca gactccccga 
aggccgcgat attcctcttc acccaaqcaa 
tacctggcag ottggcgtce ccagtgctga 
aggtgaccac gatgacgttt ccgcgtgtgg 
tgaggaagtt ggccccatqq ccggacccca 
taacgggcag qtcccgggag acttgcttcc 
ggcagtcttc nt.ccccgtag gtgctgtcct 
tgaagttctt ggccttcagg gtcccttgcc 
ccttggtggg agct.gcgaag gggatgccct 
cgagcttctt attgacgcct tccacgaacc 
cggcactcqc cactgcccag cagcaggtga 
tggtgagcot cagcotctgg atgaataaac 
tccatcttga cartt.tioctg atacagaagg 



cacccgtggg cggcacgggg gcggtctcgg 24 0 
agtgtcaccc aaggtgggca egggcctcag 300 
cggagtcacc cgtgggcggc acgggggggg 360 
gggccccgga gtcacccgtg ggcggcacgg 420 
cgggagtggc ctcggagtcc cctgtggggg 480 
tgggcagcgc cagataggtg agggtccagt 540 
tcatggagct gctgcccatc ttcttggtga 600 
agggttccca gtgtgtgggc acagccgagt 660 
agttggtcca gtaggcgatc atggccttag 720 
tgggggtggc gaagggcttc ccgaaaacgt 780 
cccatttggg gtagacgggc atccgagagg 84 0 
cactcttggc attggctctg tgctgggcta 900 
catcggtctc aaagtccacc acagtctfcct 960 
aggactcggt gtagacatca aaggtcgtct 1020 
actcactgac cagcttgtag aagtcctcct 1030 
caggcatgtc gatgctggcg aagstgtggc 1140 
cyatgtcggc ggcgttggcg tacaggttga 1200 
tgacagggac gaagcccaca tagtgcagca 1260 
tataggccag cgtcagggct cggggatcag 1320 
catcacccac agggcaaccc accttctcag 1380 
tctggatgac ccagggactc agggccacgc 1440 
ccttgttgta gggggagagg gtctgcagag 1500 
agagcgtgat gttgttgggg tcceccccga 1560 
tggccatgtg ctgatcccga agaccatagt 1620 
ggaacccaag ggggccgaca cggtagttga 1680 
cgatctcctc gccgtcatac aggtagttgt 174 0 
tgaggaaggc gcctccatag atccagatca 1300 
tgccctgggg cacccaaatg ttgaggtaca I8 60 
gggtgatggt ggcctgcagg catctcttct 1920 
agccaggatg tggctgagga ttttccaggg 1930 
tgaagatgtc cacagagtca cccaggaggc 2040 
caccttctgt gtacacggcg cccagcttcg 2100 
ggcccaacac aaccagttgc aggcgcccca 2160 
tctttgc-att atcttgagac tgctgttcct 2220 
t " 2251 



<210> 3 
<211'> 2078 
<212> DNA 

<213> Homo sapiens 
<4 00> 3 

ctgctgctcc tgggqttcct gctggtgagc 
gaagccocoa aygagcataa gtacaaagct 
ggggagccct gccacttccc cttccagt.ac 
aagggccggc caggccctca gccctggtgt 
cgatggggat actgtttgga gcccaagaaa 
tgccagaaag gagggacctg tgtgaacatg 
cacctcactg gaaaccactg ccagaaagag 
ttccacaaga atgagafcatg gtatagaact 
aagggtccLg atgcccactg ccagcqgctg 
ctc^ritgggg gtcgctgccL ggaggtggag 
ta^accggac ccttctgcga cgtggac-acc 
agctaccgcg ycctggccag gaccacyctc 
gaggccacct accgg.iac.gt gactqccgag 
gccttctgcc ggaacccgga caacgacatc 
cggctgagct ggqagtactg cgacctggca 
ccgaccccqq tgtcccctag gcttcatgtc 
aagcctcagc ccacgacccq gaccccgcct 
aagcgggagc agccgccttc cctgaccaqg 



ttggagtcaa cactttcgat tccaccttgg 60 
gaagagcaca cagtcgttct cactgtcacc 120 
caccggcagc tgtaccacaa atgtacccac 180 
gctaccaccc ccaactttga tcaggaccag 240 
gtgaaagacc actgcagcaa acacagcccc 300 
ccaagcggcc cccactgtct ctgtccacaa 360 
aagtgc-tttg agcctcagct tctccggttt 420 
gagcaagcag ctgtggccag atgccagtgc 480 
gccagccagg cctgccgcac caacccgtgc 54 0 
ggocaccgcc tgtgccactg cccggtgggc $00 
aaggcaagct gotatgatgg ccgcgggctc 660 
tcgggtgcgc cctgtcagcc gtgggcctcg 720 
caagcgcgga actggggact gggcggccac 7 80 
cgcccgtggt gcttcgtgct gaaccgcgac 840 
cagtgccaqa ccccaaccca ggcggcgcct 900 
ccactcatgc ccgcgcagcc ggcaccgccg 960 
cagtcccaga ccccgggagc cttgccggcg 1020 
aacggcccac tgagctgcgg gcagcggctc 1080 



3 



cgcaagagtc tgtcttcqat gacccgcgtc gttggcgggc tggtggcgct acgcggggcg 114 0 

cacccctycci L egccgcgct gtactggggc cacagtttct gcgccggcag cctcatcgcc 1200 

ccctgctggg tgctgecggc cgctcaetgc etgcaggacc ggcccgcacc cgaggatctg 12 60 

acggtggtqc tcggccagga acgccgtaac cacagctgtg agccgtgcca gacgttggcc 1320 

gtgcgctcct accgcttgca cgaggccttc tcgcccgtca gctaccagca cgacctggct 1380 

ctgttgcgcc ttcaggagga tgcggacggc agctgcgcgc tcctgtcgcc ttacgttcag 1440 

ccggtgtgcc tgccaagcgg cgccgcgcga ccctccgaga ccacgctctg ccaggtggcc 1500 

ggctggggcc accagttcga gggggcggag gaatatgcca gcttcctgca ggaggcgcag 1560 

gt.accgttcc tctccctqga gcgctgctca gccccggacg tgcacggatc ctccatcctc 162 0 

cccggcatgc tctgcgcagg gttcctcgag ggcggcaCcg atgcgtgcag gtgagctctt 158 0 

agccggttqg cgcccttccc cgaggccgtc agcawaatct caggtccaca gcgctgactg 1740 

cgtgtttccg ncccagggtg attccggagg cccgctggtg tgtgaggacc aagctgcaga 1800 

gcgccggctc acuctgcaag gcatcatcag ctggggatcg ggctgtggtg accgcaacaa 13 60 

gccaggcgte tacaccgatg tggcctacta cctggcctgg atccgggagc acaccgtttc 1920 

ctgattgctc agggactcat ctLtecctcc ttggtgattc cgcagtgaga gagtggctgg 1980 

ggeatggaag gcaagattgt gtcccattcc cccagtgcgg ccagctccgc gqcaggatgg 20 4 0 

cgcaggaact caataaaqtg ctttgaaaat gctgagaa 2078 



*210> 4 

<2ll> 720 

<212> DNA 

<213 > Homo s api en s 

<400> 4 

tcgdccgccc cttggcttct qcactgaizgg tgggtggatg agtaatgcat ccaggaagcc 60 
tggaggcctg tqqLLtccgc acccgctgcc acccccgccc ctagcgtgga catttatcct 120 
ct.agcgctca ggccctgcgc gcatcgccgc agstccageg cccagagaga caccagagaa 18 0 
cccaccatgg ccccctttga gcccctggct tctggcatcc tgttgttgct gtggctgata 240 
gcccccagca gggcctgcac ctgtgtccca ccceacecac agacggcett ctgcaattcc 300 
gacctcgtca tcagggccaa gttcgtgggg acaccagaag tcaaccagac caccttatac 360 
cagcgttatg agatcsagat gaucaagatg tataaagggt tccaagcctt aggggatgcc 420 
gctgacatcc ggttcgtcta cacccccgcc atggagagtg tctgcggata cttccacagg 480 
tcccacaacc gcagcgagga gtttctcatt gctggaaaac tgcaggatgg actcttgcac 54 0 
atcactacct gc^gttttgt ggctccctgg aacagcctga gcttagctca gcgccggggc 600 
ttcaccaaga cetacactcjl tggctgtgeg gaatgcacag tgtttccctg ttcccactcc 660 
cat 1 L L cl t cr.qgnc.natg aaataaagag ttaccaccca gcagaaaaaa caaacaagtc 720 



<210> 5 

<211> 556 

<212> UNA 

<213> Hoino sapiens 

<A00> 5 

tcgaocgcoc cttggcttct gcactgatgg tgggtggatg agtaatgcat ccaggaagcc 60 

tggaggcctg tggtttccgc acccgctgcc acccccgccc ctagcgtgga catttatcct 120 

ctagcgctca ggccctgcgc gcatcgccgc agatccagcg cccagagaga caccagagaa 180 

cccaccatgg ccccctttqa gcccctggct tctggcatcc tgttgttgct gtggctgata 24 0 

gcccccagci gggcctgcac ctgtgtccca ccccacccac agacggcctt ctgcaattcc 300 

gacctcgtca tcagggccaa. gttcgtgggg acaccagaag tcaaccagac caccttatac 360 

cagcgttatg agatcaagat gaccaagatg tataaagggt tccaagcctt aggggatgcc 4 20 

gctgacatcc ggttcgtcta cacccccgcc atggagagtg tctgcggata cttccacagg 480 

tcccacaacc gcagcgagga gtttctcatt gctggaaaac tgcaggtagt gatgtgcaag 540 
agtccatccg tcgtac 55 6 



<210> 6 
<211> 934 
<212> DNA 

<213> Homo sapiens 



4 



<400> G 

tcyaocgctjc cttggcttct gcactgatgg 
tgga^sTCctg tggtttccge acccgetgcc 
ctagcgctca qgccctgcgc gcatcgecgc 
cccaccatgg ccccctttga qcccctggct 
gcccccagca gggcctgeac ctytgtccca 
gacctcgtca f.c^qqgccaa gtteqtgggg 
cagcgttatg sgateaaqat gaccaagatg 
gctgacatec ggttcgtcfca cacccccgcc 
tcccacaace ycagcqaggsi qtttctcatt 
cacattcact. aacctgeaqt ttttgttygg 
cgecygggot tcaccaagac ctacactgtt 
t tat coat cc cetgcaaacc gcagagtggc 
caaggctcty aaaagqgctt ecagtcccgt 
ctgtgcaoct ggoagtccct gcgytcecag 
gaagcutgea eagtgtccac ectgttccca 
agagttacca cccagcogaa aaaacaaaca' 



tgggtggatg agtaatgeat ccaggaagcc 60 
acccccgccc ctagcgtgga catttatcct 120 
agatccagcg cccagagaga caccagagaa 180 
tctggcatcc tgttgttgct gtggctgata 240 
ccccscccac agaeggcett ctge-aattcc 300 
acaecagaag tcsaccagac eaccttatac 360 
tataasgggt tccaagcctt aggggatgee 420 
atggagagtg tetgeggata cttccacagg 480 
tgctgggaaa attgeaggat gggatttttg 540 
ttccttggga acagectgag cttagctcag 600 
ggctgtgagg aatgcacagt gtttccctgt 660 
actcattget tgtggacgga ccagctcctc 720 
caccttgcct gcctgcctcg ggagccaggg 78 0 
atagectgaa tcctgcccgg agtggaagct 84 0 
ctcccatctt tcttceggac aatgaaafcaa 900 
agtc 934 



<210> 7 
<211*> 703 
<212:> DNA 

<213> Homo sapiens 
<4 00> 7 

tcgaccgccc cltggcttct gcactgatgg 
tggaggcctg tggtttccge acccgctgcc 
ctagcgctca gqccctgcgc gcatcgcegc 
cccaccatgg ccccctttga qcccctggct 
gocccc^gca gggcctgcac ctgtgtccca 
gacctcgtca tcagggecaa gfctcgtqggg 
cagcgttatg agateaagat gaceaagetg 
gctgacatcc ggttcgtcta cscccccgcc 
tcccacaace gcogeqagga gtttotcatt 
etcattgett gtggncgqac cagctcctcc 
accttgcctq crct yectegy gagecaggge 
tagectgaat cetgcccqga gtggaagctg 
tcecatcttt ctt.eeqgaca atgaaataaa 
gtc 



tgggtggatg agtaatgeat ccaggaagcc 60 
acccccgccc ctagcgtgga catttatcct 120 
agatccagcg cccagagaga caccagagaa 180 
tctggcatcc tgttgttgct gtggctgata 240 
ccccacccac agaeggcett ctgcaattcc 300 
acaccagaag tcaaccagac eaccttatac 360 
tat&aagggt tccaagcctt aggggatgee 420 
atggagagtg tetgeggata cttccacagg 4 80 
tatccatccc ctgcaaactg cagagtggca 540 
aaggctctga aaagggcttc cagtcccgtc 600 
tgtgcacctg gcagtccctg cggtcccaga 660 
aagcctgcac agtgtccacc ctgttcccac 720 
gagttaccac ccagcagaaa aaacaaacaa 780 

783 



<210> 8 

<711> 835 

<212> DNA 

<213> Homo sapiens 

<400> 3 

tcgeccgece cttggcttct gcactgatgg 
tCjgaggcctg tqgtttccgc acccgctgcc 
ctagcgctca ggccctgcqc gcatcgcegc 
cccocuatgg eccectttga gcccctggct 
gcecccayca qgycctgcsc ctgtgtccca 
gacctcgtca teagggecaa gttcgtgggg 
ca.gcgt.tatg aqatr.aagat gaccaagatg 
get ga cat co ggttcgtcta cacccccgcc 
gctggaaaae tgcaggatyg actettgeae 
aacagectga get t age tea gegcegggge 
yaatgeacng tgtttccctg tttatccatc 
ttgtggacgg aceagctcct ccaaqgctct 
tgcctgcctc ggga.qccagg gctgtgcacc 



tgggtggatg agtaatgeat ccaggaagcc 50 

acccccgccc ctagcgtgga catttatcct 120 

agatccagcg cccagagaga caccagagaa ISO 

tctggcatcc tgttgttgct gtggctg^ta 240 

ccccacccac agaeggcett ctgcaattcc 300 

acaccagaag tcaaccagac eaccttatac 360 

tataaagggt tccaagcctt aggggatgee 420 

atggagagtg tetgeggata cttccacagg 480 

atcactacct gcagttttgt ggctccctgg 540 

ttcaccaaga cctacactgt tggctgtgag 600 

ccctgcaaac tgcagagtgg cactcattgc 660 

gaaaagggct tccagtcccg tcaccttgcc 720 

tggcagtccc tgcggtccca gatagectga 7 80 



5 



atcctgcccg gaqtgqanqc tgaagcctqc acagtgtcca ccctgttccc actcccatct 840 
ttcttccgga caatgaaata aagagttacc acccagcaga aaaaacaaac aagtc 895 



<210> 9 

<2U> 25 65 

<2I2> DNA 

<213;> Homo sapiens 

<400> 9 

ggcacgagca gccccggagc ccgggccagg gtccacctgt ccccgcagcg ccggctcgcg 60 

ccctcctgco gcagcuaccg agccgccgtc tagcgccccg acctcgccac catgsgagcc 120 

ctgctggcgc gcclgcttct. ctgcgtcctg gtcgtgagcg actccaaagg cagcaatgaa 180 

cttcatcaog ttocatcgaa ctgtgactgt ctaaatggag gaacatgtgt gtccaacaag 24 0 

fcacttctccn aca tt^actg gt.gcaactgc ccaaagaaat tcggagggca gcactgtyaa 300 

atagataagt CdaaaaccLg ctatgagggg aatggtcact tttaccgagg aaaggccagc 360 

actgacacc.ci tgqgocqgcc ctgcctgccc tggaactctg ccactgtcet tcagcaaacg 420 

taccatgccc acagatr.tga tgctcttcag ctgggcctgg ggaaacataa ttactgcagg 480 

gsggtggggg cacaaqgacc adaagcccfcc cctacagttc gcagaaacct tgttaccate 540 

cccttctccc agagggctgg ucatagcaca agagaagtgc agcctctggt tgagtcttcc 600 

ctgaggggng gaggcaggga aggccctctg ggttggaatg acatccccta tctttctgtg 660 

ttgcoaggaa uccaqacaac cggaggegac cctggtgcta tgtgcaggtg ggcctaaagc 720 

cgcttgtcc:a agagtgcatq gtgcatgact gcgcagatgg aaaaaagccc tcctctcctc 780 

cagaagaatt aaaatttcag tgtggccaaa agactctgag gccccgcttt aagattattg 840 

ggggagaatt cac caeca to gagaaccagc cctggtttgc ggccatctac aggaggcacc 900 

gggggggctc tgtcauctac gtgtgtggag gcagcctcat cagcccttge tgggt gates 960 

gcgecacaca ctgettcatt gattacccaa agaaggagga ctacatcgtc tacctgggtc 1020 

gctcaaggct taaetccaac aegcaagggg agatgaagtt tgaggtggaa aacctcatcc 1030 

tacacaagga ctacagcgct gaeaegcttg ctcaccacaa egaeattgee ttgetgaaga 1140 

tccgttccaa ggagggcagg tgtgcgcagc eateceggae tatacagacc atctgcctgc 1200 

cctcgetgta taacgatcne cagtttggca caagctgtga gatcactggc tttggaaaag 12 60 

agaattctac cgactatctc tatceggage agetgaaaat gactgttgtg aagctgattt 1320 

cccaccggga gtgt.cagcag ccccactact aeggctctga agtcaccacc aaaatgctgt 1380 

gtgctgetga occacagtgg aaaacagat.t cctgccaggg agactcaggg ggacccctcg 1440 

tctgttccct ccaaggccgc atgactttga ctggaattgt gagctggggc cgtggatgtg 1500 

ccetgaagga caagccaggc gtctacacga gagtctcaca cttcttaccc tggatccgea 1560 

gtcacaccaa ggaagagaat ggcctqgccc tctgagggtc cccagggagg aaaegggcac 1620 

cacccgcttt cttgctggtt gtcatttttg cagtagagtc atctccatca gctgtaagaa 1630 

gagactggga agataggctc tgcanagatg gat-ctgectg tgccaccacc agggygaacg 1740 

acaa tagctt taccctcagg cataggectg ggtgctggct gcccagaccc ctctggccag 1800 

gatggagggg tqqtcctqnc tcaacstgtt actgaccagc aacttgtctt tttctggact 1860 

gaagectgea ggagttaaaa agggcagggc atctcctgtg catgggtgaa gggagageca 1920 

gctcccccga cggtggqcot ttgtgaggce catggttgag aaatgaataa tttcccaatt 1380 

yggaagtgta acagctgagg tctcttgagg gagcttagee aatgtgggag cagcggtttg 204 0 

gggagcagag acactoeegs cttcagggca gggctctgat attccatgaa tgtatcagga 2100 

aatatatatg tgtgtgtatg tttgeacact tgtgtgtggg ctgtgagtgt aagtgtgagt 2160 

aagagctggt gtctgrittgt taagtfttaaa tatttcctta aactgtgtgg actgtgstgc 2220 

cacacagagt ggtctttctg gagaggttat aggtcactcc tggggectet tgggtcoccc 2280 

aegtgaedgt. gcctgggaat gtattattct gcagcatgac ctgtgaccag cactgtctca 2340 

gtttcacttt cycatagatg to.cctttctt ggccagttat cccttccttt tagectagtt 2400 

catccaatcc tcactgggtg gygtgaggac cactccttac actgaatatt tatatttcac 24 60 

tatttttatt tatatttttg taattttaaa taaaagtgat caataaaatg tgatttttct 2520 
gatgamanaa ^aaaaagggc ggccgctcta gagtatccct cgagg 2565 



<210> 10 

<211> 208 

<-21^> DNA 

<213> Homo sap ion s 



<400> 10 



6 



ggggcacagc tgtgtttgge tgcagggcca 
octceagcag ctgaattcct gnagctcagc 
tcgcaaggca cctetgagaa ott^agg&tg 
ctgggcctgg eeettgtctt tggtgaaggg 
gcccacctgg cctcagactt cggggtgagg 
gaccgcoacg tggttttctc accctatggg 
acaacaggag gagaaaccca ge-cigcagatt 
aagggcatgq cccccqccet ccggcatctg 
gatgagatca gcacnacaga cgcgatcttc 
ttcatgcccc acticttcag gctgttccgg 
gtggogagag ccagattcat catcaatgac 
agcaacttgc ttqggaaagy agccgtggac 
otctacttca acggccaqtg gaagactccc 
ttccacaaat cagacggcag cactgtctct 
aactatactq /agttraccac gcccgatgge 
c-acggggaca ccctcagcat gttcattgct 
aggatggaac tacgqggctt acaggagctt 
eagtcacatt gccatcactc ttgtactgcc 
aaaggccagt ggaagaaat;a ccctttcatc 
ccccagtaca gqggtqctgc aggtggcaga 
tcctggcctg gcoatctccc tccccagaaa 
tgacttgttt actcattgae gcagatttct 
agaaaggtca gatgcgtqoc cagctcttca 
cctaaatatt tatcatatcc ttgcccttga 
ggadaataat attati taaa ctcgctccta 
tcaggaagtc cagccacttq nctggcacac 
cacactgoca cettgtqqcc gcctgagacc 
tttttcccct tgatgcjaa.it tgaccataca 
cggagtggqg ggarrag^gac tcagatgagg 
ttaggagcag aaatqcaagg ggctgcatga 
gggtgactca naqccgcatt gqtgactcac 
gagcagtgga cacgigaggq gggggtgggt 
cttagataat atttcf.gaaa acctaccagc 
gcactttqgg aggcc^aggc gggaggattg 
gcaacatace aagaccccog tctctttaaa 
ar.atttctaa tatctttaaa tatatatata 
cacacagatq tgaaatgaat gtaatctaat 
ctgaaaaatc ctctttcttt gqggtttttc 
gtgacgtcaq ccatgtacag gatccacagg 
gaattgtatg utttttcact tttgataaat 
aaaataaata aatacgaaga atatgtcagg 



agagcgctgt caageagacc oacacgcccc 60 
agccgccgcc agagcaggac gaaccgccaa 120 
cagatgtctc cagccctcac ctgcctagtc ISO 
tctyctgtgc accatccccc atcctacgtg 240 
gtgtttcagc aggtggcgca ggcctccaag 300 
gtggcctcgg tgttggccat gctccagctg 3 GO 
caagcagcta tgggattcaa gattgatgac 420 
tacaaggagc tcatggggcc atggaacaag 480 
gtccagcggg atctgaagct ggtccagggc 540 
agcacggtca agcaagtgg© ctttfccagag 600 
tgggtgaaga cacacacaaa aggtatgatc 650 
cagctgacac ggctggtgct ggtgaatgcc 720 
ttccccgact ccagcaccca ccgccgcctc 780 
gtgcccatga tggctcegac qaacaagttc 840 
cattactacg acatcctgga actgccctac 900 
gcagacctgg ttcccactga ggccctttgc 960 
ttgtgtgcct ggtagaaact atttctgttc 3.020 
tgccaccgcg gaggaggctg gtgacaggcc 1080 
tcagagtcca ctgtggcact ggccacccct 114 0 
gtgaatgtcc cccatcatgt ggcccaactc 1200 
cag-tgtgcat gggttatttt ggagtgtagg 1260 
gcttcctttt atttttatag geatagagga 1320 
ccccccaatc tcttggtggg gaggggtgta 13S0 
gtgcttgtta gagagaaaga gaactactaa 14 40 
gtgtttcttt gtggtctgtg tcaccgtatc 1500 
acccctccgg acatccagcg tgacggagcc IS 60 
ctcgcgcccc ccgcgccccc cgcgcccctc 1620 
atttcatcct ccttcagggg atcaaaagga 1S80 
gcagggtggt ttccaatgtg ttcaatagat 1740 
cctaccagga cagaacttte cccaattaca 1800 
ttcaatgtgt catttccggc tgctgtgtgt 18 60 
gagagagaca ggcagctcgg attcaactac 1920 
cagagggtag ggcacaaaga tggatgtaat 1990 
cttgagccca ggagttcaag accagcctgg 2040 
satatatata ttttaaatat acttaaatat 2100 
tottttaaag accaatttat gggagaettg 2150 
agaagcctaa tcagcccacc atgttctcca 2220 
tttctttctt ttttgatttt gcactggacg 2280 
ggtggtgtca aatgctattg aaattgtgtt 234 0 
aaacatgtaa aaatgtttca aaaaaataat 2400 



<?10> 11 

<211> 2522 

<212> DNA 

<213> Homo sapiens 

<40O> 11 

ggggcacagc tgtgtttggr: tgcagggcca 
cctccaqcag ctgaattcct gcaqctcaqc 
tcgcaaggca cctctgagaa cttcaqgatg 
ctgqgcctqg ccctt-gtctt tqgt.gaaggg 
gcccacctgg cctcaqactL cggggtgagg 
gaccgoaacg tggttttctc accctatggg 
acaacaggaq gagaaaccca gcagcagatt 
aagqgcatqg cccccyccct ccggcatctg 
gatgagatca gcctccacfags cgcgatcttc 
ttcitgcccc acttcttcag gctgttccgg 
gtggag3Cjd.g cx a gat 1. cat catcaatgac 
agcaacttqc ttqggaaagg agccgtggac 



agggcgctgt caagaagacc cacacgcccc ^0 
agccgccgcu agagcaggac gaaccgccaa 120 
cagatgtctc cagccctcac ctgcctagtc 130 
tctgctgtgc accatccccc atcctacgtg 240 
gtgtttcagc aggtggcgca ggcctccaag 300 
gtggcctcgg tgttggccat gctccagctg 360 
caagcagcta tgggattcaa gattgatgac 420 
tacaaggagc tcatggggcc atggaacaag 480 
gtccagcggg atctgaagct ggtccagggc 54 0 
agcacggtca agcaagtgga cttttcagag 600 
t.gggtgaaga cacacacaaa aggtatgatc 660 
cagctgacac ggctggtgct ggtgaatgcc 720 



7 



ctctacttca acgg-ccagtg qaagactccc 
ttccacaadt cagaeggcag cactgtctct 
aactatactg agttcaccac gcccgatggc 
cacggggaca ccctcagcat gttcattgct 
gcc.ct caeca acsttc+.geg tgcccsgctc 
ctgccccgcc tcctqgttct gcecaagttc 
occctagaga acctgggdett gacegacatg 
ctttcaqacc aagagectet ceacgtcqcg 
aacgagagf. g qcacggi. ggc ctcctcat.ee 
cccgagg&ga teat cat gga cagacccttc 
gcatygqtta ttttggagtg taggtgactt 
ttttattttt ataggaatag agg^agaaag 
aatctcttgg tyqggagggg tqtacetaaa 
gttagagaqa aagngaacta ctaaggaaaa 
ctttgtggtc tqtqtcnccg tatctcagga 
ecggacatcc agogtgaegg agcccacact 
ccccccgcgc cccccgcgcu cctctttttc 
tcctccttca ggggatcaaa aggaeggagt 
tggtttccaa tgtgttcaat agatttagga 
aggacagaac tttccccaat tacagggtga 
gtgtcatttc cqqctgctgt qtgtgagcag 
gacaggcagc teggattcaa ctaccttags 
gtagg^caca aaqatggatg taatgeaett 
cccaggagtt caagaccagc ctgggcaaca 
tatattttaa atatacttaa atatatattt 
aaagaecaat ttatqggaga attgeacaca 
ctaatcagce caccatgttc tccactgaaa 
tct.tttttga t.tttqcactg gacggtgacg 
gtcaaatget attgaaattg tgttgaattg 
gtaaaaatgt t.tcasaaaaa taataaaata 
a<3 



ttccccgact ccsgcaccca ccgccgcctc 780 
gtgcccatga tggctcagac ca^caagttc 840 
cattactaeg acatcqtgga actgccctac 900 
gccccttatg aaaaagaggt gcctctctcfc 960 
atcagccact ggaaaggcaa catgaccagg 1020 
tccctggaga ctgaagtcga cctcaggaag 1080 
ttcagacagt ttcaggctga cttcacgagt 114 0 
caguegctge agaaagtgaa. gatcgaggtg 1200 
acagctgtca tagtctcagc ccgcatggcc 3.260 
ctctttgtgg tccctcccca yaaacagtgt 1320 
gtttactcat tgaagcagat ttctgcttcc 133 0 
gtcagatgcg tgeccagctc ttcacccccc 14 40 
tatttatcat atccttgccc ttgagtgctt 1500 
taatattatt taaactcget cctagtgttt 1560 
agtccageca cttgactggc acacacccct 1620 
gccaccttgt ggccgcctga gaccctcgcg 168Q 
cccttgatgg aaattgacca tacaatttca 174 0 
ggggggacag agactcagat gaggacsgag 1800 
geagaaatge aaggggctgc atgacctacc i8 60 
ctcacagccg cattggtgac tcacttcaat 1920 
tggacacgtg aggggggggt gggtgagaga 1930 
taatatttct gaaaacctac cagecagagg 2040 
tgggaggeca aggegggagg attgettgag 2100 
taccaagacc eccgtctctt taaaaatata 2160 
ctaatatctt taaatatata tatatatttt 2220 
gatgtgaaat gaatgtaatc taatagaagc 2280 
aatcctcttt ctttggggtt tttctttctt 2340 
teagecatgt acaggatcca caggggtggt; 24 00 
tatgettttt cacttttgat aaataaacat 24 60 
aataaatacg eagaatatgt caggacaaaa 2520 

2522 



<210> 12 
<211> 22 7?, 
<212> DNA 

< 2 1 3 > H omo s a p i en s 
<40C» 12 

ctcttccccg ctgagoggsa cagccastgc 
ccggccgaca gccccqagac gacagcccgg 
era ycgctnr.a ggcccegeyc tccccyctcg 
tgccaaccat gaccgccgcc agtatgggcc 
ccctctgcag ccggccggcc gteggecaga 
agccggcgcc gcgctgcceg gcgggcgtga 
gcgtotgeqc enngcagctg ggcgagctgt 
agggectett etgtgacttc ggctcoccgg 
aagatggtgc tccctgcatc ttcggtggta 
geagctgeaa gtgccagtgc acgtgcctgg 
gcatggacgt tcgtctgccc agcccLgact 
ccagacccaa ctatg^ttag egccaactgc 
tccaagacct qtgggatggg catctccacc 
etagaqaagc- a gage**, eject qtgcatggtc 
ttaagaaggg caaaaagtgc atceytactc 
tttctggctg caccaqcatg aagscatacc 
gecgatqetg caccccccac aqaaccacca 
gcgaggtcat gaagaaqaac atgatgttca 
ocggagacaa fgacatcttt gaatcgctgt 
gaayecagag aqtgngaysc attaactcat 
atttttccqt aaaaatgatt tcagtageac 
ggariaaqatt cccacccaat tcanaacatt- 



gwctccwccc tccagctcga cggcagccgc 60 

cgcgtcccgg tccccacctc cgaccaccgc 120 

eugccaccgc gccctccgct ccgcccgcag ISO 

ccgtccgcgt cgccttcgtg gtcctcctcg 2^0 

actgeagegg gccgtgccgg tgcccggacg 300 

gcctcgtgct ggacggctgc ggctgctgcc 360 

gcaccgagcg cgacccctgc gacccgcaca 420 

ccaaccgcaa gateggegtg tgcaccgcca 480 

cggtgtaccg cageggagag tccttccaga 54 0 

aeggggeggt. gggctgcatg cccctgtgca 600 

gccccttacc gactggaaga cacgtttggc 660 

ctggtccaga ccacagagtg gagegectgt 720 

egggttacca atgacaaege ctcctgcagg 78 0 

aggcctt.gcg aagtgacctg gaagagaaca 8^0 

ecaaaatctc caagcctatc aagtttgagc 900 

gagctaaatt ctgtggagta tgtaccgacg 960 

ccctgccggt ggagttcaag tgccctgacg 102 0 

tcaagacctg tgcctgccat tacaactgtc 10SO 

actacaggaa gatgtacgga gacatggcat 1140 

tagactggaa cttgaactga ttcacatctc 1200 

aagttattta aatctgtttt tctaactggg 12 60 

gtgccatgtc aaacaaatag tctatcaacc 1320 



ccagacactg y L XX.gnn.qa a tqttaagset 
cagaatqtal: at ta^ggtgt qyctttagga 
tatoatcaga tagca Lctta tacgagtaat 
gaaaatbLta gcgr.gctcac Lgaectgcct 
Ctccagccat caagagactg agtcaaqttg 
tgacattctg at tegs at. ga cactgttcag 
gcttgtqqca agtgaatttq cctgtaacaa 
attgtgtgtg fcqtglgtgtg tgtatata.ta 
aagttgtttg r.gccttttta t Itttgtttt- 
atttctgaac acca liaggta gaatgtaaaq 
tacttataty gaaattctgc tcagatagaa 
aggggaggca tc-aqL gtnct tggcaggctg 
ttttaatqaa. caaatggcct Uiattaaaaa 
ttcacotgga agcatttytt totartttga 
gagagtgtga ccaaa.igtta catgtttgca 
1 1: t ct a t n n a aa a t q t eg a C c ggee g ct aa 



fcgaeagtgga actacattag tacacagcac 1330 
gcagtgggag ggtaccagca gaaaggttag 14 4 Q 
atgectgeta tttgaagtgt aattgagaag 1500 
gtagccccag tgacagctag gatgtgcatt 1560 
ttccttaagt cagaa cages gactcagctc 1620 
gaatcggaat cctgtcgatt agactggaca 1650 
gecagatttt ttaaaattta tattgtaaat 17 40 
tatatatgta cagttatcta agttaattta 1800 
taatgctttg atatttcaat gttagectea 18 60 
eLLgtctgat cgttcaaagc atgaaatgga 1920 
tgacagtccg tcaaaacaga fctgtttgcaa 1980 
atttctaggt aggaaatgtg gtagcctcac 2040 
ctgagtgact ctatatagct gatcagtttt 2100 
tatgactgtt tttcggacag tttatttgtt 2160 
cctttctagt tgaaaataaa y tgtat attt 2220 
tttaqtagta gtaggeggea gc 2272 



<210> 13 
<2ll> 1153 
<212> DNA 

<213> Homo sapiens 
<4 0Q> 13 

tgaaatatyg agaatagtct tagatgtgtt 

ggagcttcct tqclcagcgc gcatcttcaa 

qcaclycgct. tcctctcaqa accttctgat 

ctcgactgci ccgcggagtc cgaecgagga 

i L c a t c t g gc c 1 1 qqga a y ga t g a a a gga 

tacaaaacat acttcattcc agacaccaca 

catctttaqq ayattctggc tcaattatta 

cactgaggtt ccttt cacao arragaatctg 

tcaagtgtga aqtcattggg gayeecatgc 

acctgaclcr. riatcccaqqt gactcccgag 

tcagccgact ccaaccgggg gacattggaa 

gctcoagaac aggaaaLgaa gcagaagtca 

agctgtattt ictgeaaaga ccatccaatg 

tygaatgttg tgtttctqqo tatoctccac 

tcatccaact caqgtctaaa aagtattctt 

atqtgacaga tgatgacagL ggaatgtata 

ttagtgcctc tqcagagcto acagtcttqq 

tttgggtata nggattqcao taggttctgg 

tcagtgggct i. cgta taatt acaattegta 
a c r. 1 1 r. w w 1 1 a a a 



tgggtaccca agctggcttt tgtactcttc 60 
gtaaccggtt ttcaaattaa agctttcaca 120 
gccgtcacaa tguggggagg aaatgtcctc 180 
gttccagtga tcaagtggaa gaaagatgea 240 
agcagcaact tbcaaatggg tetctgetga 300 
agecagatga gggacrttac caatgtgagg 360 
gteggacage aaaagttgca gtagcaggac 420 
tcacagcctt catgggagac acagtgetae 480 
caacsatcca ctggcagaag aaccaacaag 54 0 
tggtggtctt gccchctgga geattgeaga 600 
tttaccgatg ctcagctcga aatccagcca 660 
gaattttatc agatccagga ctgeatagae- 720 
tagtagecat tgaaggaaaa gatgetgtec 7 80 
caagttttac ctggttacga ggegaggaag 34 0 
tattgggtgg aagcaacttg cttatctcca 900 
cctgtgttgt cacatataaa. aatgagaata 960 
taatcatcga taaagtattg gtggacacct 1020 
gaagactaca gaaacatctg aaacctggct 1080 
aaatcagttg gecwaatact gtctgaataa 114 0 

1153 



<210> 14 
<211> 4459 
<212> DNA 

< 2 1 3 > H cmo s ap \ 9. n :i 
<400> 14 

tgaaatatgej agdritagtct tagatgtgtt 

ggagcttcct iqctcaqcgc gcatcttcaa 

gcactgcgct tcctctcaga accttctga.t 

ctcgactgct* ccqcggagtc cgaccgagga 

ttcatctqqc flirggga^tg gatgaaagga 

tacaaaacat act teat tec agacaccaca 

catctttagg agottct.ggc tcaattatta 

cactgaggtt ccttlcacag acagaatctg 

Lcaagtgtga aqtcattggg gagcccatgc 



tgggtaccca agctggcttt tgtactcttc 60 

gtaaueggtt ttcaaattaa agctttcaca 120 

gccgtcacaa tgeggggagg aaatqtcctc 180 

gtrccagtga tcaagtggaa gaaagatgea 24 0 

agcagcaact ttcaaatggg tetctgetga 300 

agecagatga gggactttac caatgtgagg 3 60 

gteggacage aaaagttgca gtagcaggac 4 20 

tcacagcctt catgggagac acagtgetae ^80 

caacaatcca ctggcagaag aaccaacaag 540 
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^octgactcc oatcrcaggt gactcccgag 
tcaqccgact coiaccgggg gaca ttggaa 

gctcaagnae .igqanatgaa gcagaagtca 

agctgtattf tot qcanaqa ccatccaatg 

tggaatgt-tcj tgtttetggc tatcctccac 

t.catccaart cagqtctaaa aagtattctt 

atgtqacaqa I gat gaca gt ggaatgtata 

ttagtgcctc tgr.agagetc acagtcttgg 

acctgtatqc cLatgaaage atggatatfcg 

tgcccactgt gaattggatg aagaatggag 

tagtgggagg aagcnactta cggat^cttg 

aatgtgtgge tgaaaatqag gctggaaatg 

agcctgcaat cccaaqctcc agtqtcctcc 

tggtttncag ccqatttgtc cgtctcagct 

ttcaaacttt cncqgtcttt ttctccagag 

caacacagcc tggglccctt cogctcactg 

CCtttcqagt tgtgqcttac aatgaatggg 

tggccacaca gcctgagttg caagttccaq 

cotcac^tac ctc;aattctt attacctggg 

aaggttacag attgt.tct.gc actgaggtgt 

atggacrtatc ttatnaactg gaaggectga 

tagcttataa tr.gc.:tatggt ccgqgcgtct 

ctqaogtqcc aaqtgccccg cctcagaacg 

tcaaagttag ctggetqcct. cctr.catcag 

aaattcgaca cagaaagaeg aoecgnaggg 

tctggtacct attcacaggo ctggagaaag 

tgacagt.c&a tggtactgqa ccaccttcca 

atctagstga atcteaagtt cetgatcaac 

actgcatcat catqagttqg ac.tectccct 

ttatcggtta tggogttggg aqcccttacg 

gatattatte cattgagaqg ttagagteaa 

ttaacaatgc cggagaagga gttcctcttt 

atoccactga cocaqttgat tattatrctt 

ytct-Ctccar: cccc.ri i.qctc. ccaccagtag 

ctgugagggt cagctgggca. gacaactctq 

gactttacac cytccggtgg ageacr.agct 

acacaacatc tctaagttac acageaacag 

cggtcatggt aacaaaaaac agaaggfccca 

cgtatgaagc agoccccaee tctgct ccea 

agcctcgt qc cgtcattgtg agttggcagc 

ettocatctt attttatacc ttggacaagg 

eaatcagtgg tgataggctt acfccatcaaa 

actttcgaat tcaagcacga aattcaaaag 

tcagqactct gaaagtggaa caccctgaca 

atggagqttvi ttggccagtt qatacUaatt 

caattggaca aatgcacccc ccgcatggca 

ttgtgntcat tgtggtcacc gttggtgtca 

tgatttgcac ccgauqctct tcagcccagc 

gcaaaagqaa ggqcagccag aaggacctcc 

aaatggaqat gaaaaatatt gaaaayccat 

ccatccaaog ttgccaagec ct.cacaccag 

gnaqcaaaag r.acctcl cat tcsggtcaag 

cLctggcjgag ytcgctggct gcacgr.cgag 

atgcccagtc cafioaatcxt gctgtcgtga 
cccagtaccc nggaatcctc ccytctccca 
tocggcctgt gccattccca acactetcag 
agtcaqtgaq tg^aqganca actacccaac 
agcattctaq r.ogcgaggag gcsccaagca 
ctbacrcact ocgcagcttt gctaatcctt 
cyaaaqtncc ttticacacca cttttgtcrc 
tgaaaacagc ctcecttgqg t.fcqgctggaa 



tggtggtctt gccctctgga gcattgcaga 600 
tttaccgatg ctcagctcga aatccagcca 660 
gaattttatc agatccagga ctgcatagac 720 
fcagtagccat tgaaggaaaa gatgctgtcc 780 
caagttttac ctggttacga ggcgaggaag 84 0 
tattgggtgg aagcaacttg cttatcfccca 900 
cctgtgttgt ca catataaa aatgagaata 960 
ttccyccatg gtttttaaat catccttcca 1020 
agtttgaatg t^cagtctct ggaaagcctg 1030 
atgtggtcat tcctagtgat tattttcaga 114 0 
gggtggtgaa gtcagatgaa ggcttttatc 1200 
cccagaccag tgcacagctc attgtcccta 12S0 
cttcggctcc cagagatgtg gtccctgtet 132 0 
ggcgcccscc tgcagaagcg aaagggaaca 1380 
aaggtgacaa cagggaacga gcattgaata 1440 
tgggaaacct gaagccagaa gccatgtaca 1500 
gaccgggaga gagttctcaa cccatcaagg 1560 
ggccagtaga aaacchgcaa gctgtatcta 1620 
aaccccctgc ctatgcaaac ggtccagtcc 1680 
ccacaggaaa agaacagaat atagaggttg 17^0 
aaasattcac cgaatatagt cttcgattct 1800 
ctactgatga fcataacagtg gt^racacttt i8 60 
tctccctgga agtggtcaat tcaagaagta 1920 
g^acacaaaa tggatttatt accggctata 1980 
gtgagatgga aacactggag ccaaacaacc 2040 
gaagtcagta cagtttccag gtgtcagcca 2100 
actggtatac tgcagagact ccagagaatg 2160 
caagctctct tcatgtgagg qcccagacta 2220 
tgaacccaaa catcgtggtg cgaggttata 2280 
ctgagacagt gcgtgtggac agcaagcagc 23^9 0 
gttcccatta tgtaatctcc ctaaaagctt 2400 
atgaaagtgc caccaccagg tctataaccg 2460 
tgcttgatga tttcr.ccacc tcggtcccag 2520 
gtgtacaggc tgtggctctt acccatgatg 2580 
t.ccctaagaa ccaaaagacg tctgaggtgc 2640 
tttctgcaag tgcaaaatao a^gtcagaag 2700 
gcctcaaacc aaacacaatg tatgaattct 27 60 
gtacttggag catgactgca catgccacca 2820 
aggactttac agtcattsct agggaaggga 2880 
clcccttgga agccaatggg aaa&ttactg 2940 
acatcccaat tgatgactgg attatggaaa 3000 
tcatggatct caaccttgat actatgtatt 3060 
gagtggggcc actctctgat cccatcctct 3120 
aaatggctaa tgaccaaggt cgtcatggag 313 0 
tgattgatag aagcacccta aatgagccgc 32^0 
gtgtcactcc tcagaagaac agcaacctgc 3300 
l:cacagtgct ggtagtggtc atcgtggctg 3360 
agagaaagaa acgggccacc cacagtgctg 34 20 
gaccccctga tctttggatc catcatgaag 3480 
ctggcactga ccctgcagga agggactctc 3540 
tcagccacag ccagtcagaa acccaactgg 3600 
acactgaggs agcagggagc tctatgtcca 3660 
ccccccgggc caagctcatg attcccatgg 3720 
gcgcc^Lccc ggtgccaacg ctagaaagtg 3780 
cctgtggata tccccacccg cagttcactc 38^0 
tggaccgagg tttcggagca ggaagaagtc 3900 
aaccacctat gctgccccca -tctcagcctg 3960 
gaaccatccc cacagcttgt gttcgaccaa 4 020 
tgctacctcc accaatgagt gcaatagaac 4080 
agccagggcc cactcttcct aagacccatg 41^0 
aagcaagstc ccctttgctt cctgtgtctg 4200 
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tgcuaacaqc rrcctgaagtg tctgaqgaga 
taagtgcntc tttaaaattc atgctteatc 
gcatctriaac craatgagtac tgtccattca 
ttgttgttct tattqttgtt tgttcctttg 
ta35.tct.aty atattaaaa 



gccacaaacc aacagaggat tcagccaatg 426Q 
aagggacaga ttgactggcg ctgtgtaata 4320 
ccccaggcca ccccsggcca ccccagtcca 4380 
ttttcctgca cttaaatatg aagaggaaat 4440 

4459 



<210> 15 
<211> 3744 
*212'> DNA 

<2,1.3> Homo Sripxens 
<4 00> 15 

ccgaytgaca aggaggt.ggg agagggtagc 
tccc^ctgct qctgaagct.g ccctqcccat 
ccagggctag gagtgggcct gccqttcatg 
agagaggctt gqgctggtgg i;gggcccggc 
gccectggat gccttcctcc ctgqgaaagc 
gcataqatgg ggncatcaria ctgggaggcc 
gcaagccctg t'jgagaactt nayaaggaaa 
tcgccctgga tcgcatcanc aacgacccgg 
gcattctgga cacctgetcc agggacaccc 
aggcgctcat cgagaaggat ggcacagagg 
tcaccaagrc tgancqtgtg gtgggtgtco 
tggtggccaa catcctt-.cgc ctcttcaaga 
cagacctgag tgacaacagc cgctacgact 
aucaggccca ggccatggtq gacatcgtcc 
tggcctcgga gqgcagetat ggtgagagcg 
aggacggggg cgtgtqcate qcccaytcgg 
agttcgacas gatcatccgc cgcctcctqg 
ttgccaacga ggatgacatc aggcgtgtgc 
qccatttctt ctggatgggc totqacagct 
tgg&ggaggt ggctgagggt gctgteacga 
gtgagcga.Tt tgggoaggat tcagottatg 
ntgccgtgta cgccatggge cacgcgctgc 
ycgtggqgct ctqccngcqc atggaccctg 
gaaacgtcaa ctt c tc.iggo atcgeaggga 
cgcctggqcg ctatgacatc fcaceaatacc 
toattggcic ctggactgac cacc.tgcacc 
gcgggeagca gr.Ttgcccegc tccatctgca 
agacagtgaa gqgcatycct tgctgctggc 
aggtggaccg ctaeacctgt aagacgtgtc 
eqggctqccg geccatcccc atcatcaagc 
ccrj-cttcot qgcogtggtg ggcatcgctg 
gctacaacga cacqcccatc gtcaaggcct 
caggc-atctt cctgtgctat gccacaacct 
cctgctcgct gcgecgaatc. ttcctgggac 
tcaccaagac caaccgcatc: taccgcatct 
nacgctt.cat cayccccgcc tc.acagctgg 
tgctgggcat ctgf.gtqtgg tfctgtggtgg 
accagcggac actcgacccc cgcttcgcca 
tgtcgctcat; ctgcetqctg ygctacagca 
ccatcaagac acgcqqcgtg cccg^gacct 
tqtacaceac ttqcatcgtc tggctggcct 
uggccgacaa gctgtacatc cagacgacga 
cggt.gtccct yggaatgctc tacatgccca 
ayaacgtgcc ca.agcgo.dag cgcagcctca 
picaagtteac geagaaggqc aacttccggc 
agaaccttyja ggccccagcg crggccacca 
caatctagcq agtccatgga gotgaqcagc 
gcgtcgggcc rtyggGcacac ccaagggcec 



agcatgggct acgcggttgg ctgccctcag 60 
goccacccag gccgtggggc csggggcctg 120 
gytctetagg gatttccgar atgcctggga ISO 
tgcccctttg cctgctcctc agcctttacg 240 
ccaaaggcca ccctcacatg aattccatcc 300 
tgttcccggt gcatggccgg ggctcagagg 360 
agggcatcca ccggctggag gccatgctgt 420 
acctgctgcc taaeatcacg ctgggcgccc 430 
atgccctcga gcagtcgctg acctttgtgc 54 0 
tccgctgtqg cagtggcggc ccacccatca 600 
tcggtgcttc agggagctcg gtctccatca 660 
taccccagat cagctacgcc tccacagcgc 720 
tcttctcccg cgtggtgccc fccggecacgt 780 
gtgccctcaa gtggaactat gtgtccacag 840 
gtgtggaggc cttcatccag aagtcccgtg 900 
tgaagatacc acgggagccc aaggcaggcg 960 
agacttcgaa cgccagggca gtcatcatct 1020 
tggaggcagc acgaagggcc aaccagacag 1080 
ggggctccaa gattgcacct gtgctgcacc 1140 
tcctccccao gaggatgtcc gtacgagacc 1200 
acjcaggaggg gaaggtgcag thtgtgatcg 1260 
acgccatgoa ccgtg^cctg tgtcccggcc 132Q 
tagatggcac ccagctgctt aagtacatcc 1380 
accctqtgac cttcaatgag aatggagatg 1440 
agctgcgcaa cgattctgcc gagtacaagg 1500 
ttagaataga gcggatgcac tggccgggga 1560 
gcctgccctg ccaaccgggt gagcggaaga 1620 
ectgcgagcc ttgcacaggg taccagtacc 1680 
cctatgacat gcggccqaca gagaaccgca 17 40 
ttgagtgggg ctcgccctgg gccgtgctgc IS 00 
ccacgttgtt cgtggtgatc acctttgtgc 1860 
cgggccgtga actgagctac gtgctgctgg 1920 
tcctcatgat cgctgagccc gaccttggca 198 0 
tagggatgag catcagctat gcagccctgc 2040 
tcgagcaggg caagcgctcg gtcagtgccc 2100 
ccatcacctt cagcctcatc tcgctgcagc 2160 
acccctccca ctcggtggtg gacttccagg 2220 
ggggtgtgct caagtgtgac atctcggacc 2280 
t get get cat ggtcacgtgc accgtgtatg 234 0 
tcaatgaggc caagcccatt ggcttcacca 2400 
toatccccat cttctttgqc acctcgcagt 24 60 
cgctgacggt ctcggtgagt ctgagcgcct 2520 
aagtctacat catcctcttc cacccggagc 2530 
aagccgtcgt tauggeggee accatgtcca 2640 
coaaeggaga ggecaagtet gagctctgeg 2700 
aacagactta cgtcacttac accaaccatg 2760 
aggaqgagga grcgtgaccc tgtggaaggt 2Q20 
agctgtcttq cctgcccqtg ggcacccacg 2880 



II 



gacgtggctt gqtgeujagg atagcagagc ccrccagccat cactgctggc agcctgggca 2940 
aaccgggtga gcaacaggag gacgaggggc cggggcggtg ccaggcta.cc acaagaacct 3000 
gcgtcttgga ccalztgcccc toccggcc.cc aaaccacagg ggctcaggtc gtgtgggccc 3060 
eagtgctaga tctctccctc ccttcgtctc tqtctgtgct gttggcgacc cctctgtctg 3120 
tctccagccc tgtctttctg ttctcttatc tctttgtttc accttttccc tctctggcgt 3130 
ccccggctgc ttgtactctt ggccttttct gtgtctcctt tctggctctt gcctccgcct 3240 
ctctctctca tcctctttgt cctcagctcc tcctgctttc ttgggtccca ccagtgtcac 3300 
ttttctgccg ttttctttcc tgttctcctc tgcttcsttc tcgtccagcc attgctcccc 3360 
tctccctgcc acceitcccc agttcaccaa accttacatg ttgcaaaaga gaaaaaagga 3^20 
aaaaaaatca aaacacaaaa aagceaaaac qa4.aac6d.at ctcgagtgtg ttgccaagtg 3480 
ctgcqtccte ctggtggcct ctgtgtgtgt ccctgcggcc cgeagcctgc Ccgcctgccc 3540 
cgcccatctg ccgtgtgtct tgcccgcctg ccccgcccgt ctgccgtotg tcttgcccgc 3600 
ctgcccgcct geocctcctg ccgaccacac ggagttcagt gcctgggtgt ttggtgatgg 3660 
ttattgacga caatgtgtag egcatgattg tttttatacc aagaacattt ctaataaaaa 3720 
taaacacatg qttttgcaaa aaaa 37 44 



<210> 16 

<?!!> 2899 

<212> DNA 

<2l3> Homo sapiens 

<400> 16 

cgcgggagcc aacttcaggc tgctcagagg aagcccgtgc agtcagtcac ctgggtgcaa 60 
gagcgttqct grctcgggcr cteccqctgc ayggagagcg gcactcgctg gcctggatgt 120 
ggttggattt aggggggctc cgcagcaggg gtttcgtggc ggtggcaagc gctgcaacag 180 
gtagacggcg agagaeggac cccggccgag gcagggatgg agsccaaagg ctaceacagt 240 
cLccctgaag gtctagatat ggaaagacgg tggggtcaag tttctcaggc tgtggagcgt 300 
tcttccctgg gacutacaga gaggaocgat gagaataact acatgyagat tgtcaacgta 360 
agctgtgttt ccggtgctat tccaaacaac agtactcaag gaagcagcaa agaaaaacaa 420 
gaactactcc cttgccttea gcaagacaat aatcggcctg ggattttaac atctgatatt 480 
aaaactgagc tggaatctaa gqaactltca gcaactgtag ctgagtccat gggtttatat 540 
atggattctg taagagatgc tgactattcc tatgagcagc agaaccaaca aggaagcatg 600 
agtccagcta agatttatca gaatgttgan cagctggtga aattttacaa aggaaatggc 660 
catcgtcctt ccactctaag ttglqlgaac acgcccttga gatcatttat gtctgactct 720 
gggagctccg tqaatgytgg cgtcatgcgc gccattgtta aaagccctat catgtgtcat 780 
gagaaaagec cgtctgtttg cagccctctg aacatgacat cttcggtttg cagccctgct 840 
ggaatcaact ctqtgtcctc caccacagcc agctttggca gttttccagt gcacagccca 900 
atcacccaqg gaactcctct gacatgctcc cctaatgctg aaaatcgagg ctccaggtcg 960 
cacagccctg cecatgctag caatgtgggc tctcctctct caagtccgtt aagtagcatg 1020 
aaaccctcaa tttccagccc tccaagtcac tgcagtgtaa aatctccagt ctccagtccc 1080 
aataatgtca ctctgagatc ctctgtgtct agccctgcaa stafctaacaa cfccaaggtge 1140 
tctgtttcca gcccttcgaa eactaataac agatccacgc tttccagtcc ggcagccagt 1200 
actgtgggat clatctgtag ccctgtaaac aatgccttca gctacactgc ttctggcacc 1260 
tctgctggat ccagtacatt ycgggatgtg gttcccagtc cagacacgca ggagaaaggt 1320 
gctcaagagg tcccttttcc taagactgag gaagtagaga gtgccatctc aaatggtgtg 1330 
acLggccagc ttaatattgt cuagtacata aaaccagaac cagatggagc ttttagcagc 1440 
tcatgtctag gaggaaatag caaaataaat tcggattctt cattcteagt accaataaag 1500 
caagaatcaa ccaagcattc atgttcaygc acctctttta aagggaatcc aacagtaaac 1560 
ccgtttccat LLatggatgg ctcgtatttt tcctttatgg atgataaaga ctattattcc 1620 
Ctatcaggaa ttttaqga.ee acctgtgccc ggctttgatg gtaactgtga aggcagegga 1680 
ttcccagtgg gtattaaaca agaaceaga.t gaegggaget attacccaga ggccagcatc 17 4 0 
ccttcctctg ctattgttgg gqtqaattca ggtggacegt ccttccacta caggattggt 18 00 
gctcaaggta caaLatcttt atcacgatcg gctagagacc aatctttcca acacctgagt 1860 
tcctttcctc ctgtcaatsc Lttagtggag tcatggaaat cacacggcga cctgtcgtct 1920 
agaaqaaqtq atgggtatcc ggtcttagaa tacattccag aaaatgtatc aagctctact 1980 
ttacgaagtq tttctdcigg atcttcaaga ccttcaaaaa tatgtttggt gtgtggggat 2 04 0 
gaggrttnag gatqecatta tggggtagtc acctgtggca getgeaaagt tttgttcaaa 2100 
aqagcagtgg aagggcaaca naactattta tgtgctggaa gsaatgattg catcattgat 2160 
aagattcgac gayygaattg tcctgcttgc agacttcaga aatgtcttca agctggaang 2220 
aatttaggag cacgaaagtc aaagaagttg ggsaagttaa aagggattca cgaggagcag 2280 



12 



ccanageagc agt_-agccccc accccuaccc 
acgtacatog ctcctgcaaa agaaccctcg 
acaatctcac gaqcgctcac accttccccc 
attqtatalq caggcLatga cayctcaaaa 
ctcaacogct tagoaqgoria acagatqatc 
ggatttaaaa acttgcctct tgaggaccaa 
ctateateat ttqccttgag ctggagatcg 
tttgcaccay acctagLctt taatgaactg 
agtggcactt cacatgcctc tgtggattct 
ocaccca^ac c:otettgcgt ctgaagcaot 
tacacaactc aqttgca aa 



ccacccccgc aaageccaga ggaagggaca 234 0 

gtcaacacag cactggttcc tcagctctcc 2400 

gttatggtcc ttgaaaacat tgaacctgaa 2460 

ccayatacag ccgaaaatct gctctcc^cg 2520 

eaagtcgtga agtgggcaaa ggtacttcca 258 0 

attaccctaa tccagtattc ttggatgtgt 2640 

tacaaacata cgaacagcca atttctctat 2700 

ttggcgagag tgagagaggg ttgaagtctt 2760 

ctgagtgtct ctctgggatg aaggtgactt 232 0 

agttgtgtga gggcagctcc attagacgag 2880 

28S9 



<21G> 17 
"211> 27fJ0 
<2l2s UNA 

<213> Homo sapiens 
<4G0> 17 

egoyggagee aacttcagge tgctcagayg 
gagegttget gectogggot ctcccgctgc 
ggttggefctt aggggggctc cgcagcaggg 
gtagacggcg agagaeggae cccggccgag 
ctccctgaog gtctagatat ggaaagaegg 
tcttccctgg gacctacaqa gaggacegat 
agctgtgttt ooqqtgctat tccaaacaac 
gaactaetcc cttgccLtea gca?gacaat 
aaaactgagc Ujqaatctaa ggaactttca 
af.ggattctq taagagatqc tgactattcc 
agtcoagcta ,igoL ttatca gaatgtfcgaa 
catcgtcctt ecactctriag ttgtgtgaac 
gggagctccg tgna Uqqtgg cgtcatgcgc 
qagaaaagee cqtctgtttg cagccctctg 
ggaatcaact r:tgtgtcctc Criccaoagcc 
atcacccagq gaactcctct gaeatgetee 
cacayccctg cacatgetag caatgtgggc 
aaatcctcaa tttccagccc tccaagtcac 
aataatgtca ctctgagetc ctctgtgtct 
tetqtttcca gcccttcgaa cactaataac 
actcrtgggat ctatctgtag ccctgtaaac 
tetgetqgat ccogtacatt gcgggatgtg 
yctcaagagq tccctt.ttcc taagactgag 
actggccagc ttaatattgt ccagtacata 
tcatgtctag gaggaaatag caaaataaat 
caagaatcaa ccaagcattc: atgttcaggc 
ccgtttccat ttatggatgg otcgtatttt 
ctatcaggao ttttaggacc acetgtgccc 
ttcccagtgg qtattnaaea agaaccagat 
ccttcctctg c.tattgttgg ggtgaattca 
qctcaaggta caatatcttt aLcacgatcg 
tccttteetc ctgt.^aatac tttagtggag 
agaagaagtg atqqgtatcc ggtcttagaa 
ttacgaagtg tttctactyg atcttcaaga 
gaggcttcaq gn tgc.:catta tggggl.agtc 
agaqcagtgg aogqqvaaca csactattta 
aagettcgao gyaagaattg U: r .:tgcttge 
aatttaggag cacgaaagtc saagaagttg 
ccacagcaqc agnagccccc acccccaccc 
aegtacateg ctcctgcaaa agaaccctcg 
aoastctcac gagcqctcac accttccccc 
attgtatatq caggctatqa cagct ra a aa 



aagcccgtgc agtcagtcac ctgggtgcaa 60 
aqggagagcg gcactcgctg gcctggatgt 120 
gtttcgtggc ggtggcaagc getgeaacag 180 
gcagggatgg agaccaaagg ctaccacagt 24 0 
tggggtcaag tttctcaggc tgtggagcgt 300 
gagaataact acatggagat tgtcaacgta 360 
aytactcaag qaagcagcaa agaaaaacaa ^20 
aatcggcctg ggattttaac atctgatatt 430 
gcaactgtag ctgagtccat gggtttatat 540 
tatgagcagc agaaccaaca aggaagcatg 600 
cagctggtga aattttacaa aggaaatggc G60 
acgcccttga gateatttat gtctgactct 720 
gccettgtta aaagocctat catgtgtcat 780 
aacatgacat cttcggtttg cagccctgct 340 
agctttggca gttttccagt gcacagccca 900 
ectaatgetg aaaatcgagg ctccaggtcg 960 
tctcctctct caagtcegtt aagtagcatg 1020 
tgcagtgtaa aatctccagt ctcgagtccc 1080 
agccctgcaa atattaacaa ctcaaggtgc 1140 
agatucaege tttccagtcc ggcagccagt 1200 
aetgecttea gctacactgc ttctggcacc 1260 
gttcccagtc cogacacgca ggagaaaggt 1320 
gaagtagaga gtgccatctc aaatggtgtg 1380 
aaaccagaac cagatggagc ttttagcagc 1440 
teggattett cattctcagt accaataaag 1500 
acctctttta aagggaatcc aacagtaaac 15 60 
tcctttatgg atgataaaga ctattattcc 1620 
ggctttg^tg qtaactgtga aggcagegga 1680 
gaegggaget attacccaga ggccagcatc 17 40 
ggtygacagt ccttccacta caggattggt 1800 
qctagagacc aatctttcca acacctgagt I8 60 
tcatggaaat cacacggcga cctgtcgtct 1920 
tacattccag aaaatgtatc aagctctact 1980 
ccttcaaaaa tatgtttggt gtgtggggat 2O40 
acctytggca getgeaaagt tttcttcaaa 2100 
tgtgetggaa gaaatgattg catcattgat 2160 
agacttcaga aatgtctfeca agctggaatg 2220 
ggaaagttaa aagggattca cgaggagcag 223 0 
ccdccfjccgc aaageccaga ggaagggaca 2 34 0 
gtcaacacag cactggttcc tcagctctcc 2400 
gttatggtcc ttgaaaacat tgaacctgaa 24 60 
ccagatacag ccgaaaatct gctctqcacg 2320 



13 



ctcaaccgct tngca ggca a acagatgatc 

ggatttaaaa dcLtyectct tgaggaccaa 

ctatcatc«it ttgccttgag ctggagatcg 

fcttgcaccag acct.agt.ctt taatgaataa 
ctgcagagtt aUjegfctasa 



caagtcgtga agtgggcaaa ggtacttcca 2580 
attaccctaa tccagtattc ttggatgtgt 2640 
tacaaacata cgaacagcca atttctctat 2700 
gtacctatta sttagccttc ataaaataaa 2760 

2780 



<210> 13 
<211> 323 9 
<212> DNA 

< 213 > Homo sapiens 
<4 00> 18 

ygcaacacgc ttttaccagg aactcgctat. 
c eg c t c 1 1 ri g c c. a ga a t g a t. a go g g g <j ca. t 
agccaLggct Lettcacayc ataatcatag 
gtaagagttt etbgggaaaa gatettccaa 
tcttcctttt ctctcacaga gggcacaatc 
tcagtggtta aaactgLatt aaggccaaat 
gtattcaatt cccccttaac taaagttgtc 
tcaattggag gatcaaagtt gagaaasgta 
qecaccaaga cgatgtcata gaagtctgaa 
tcatacatct btgttggatc tcctgtgtac 
actgagecag atataagatt gcttttggat 
ttgccacctt ctactgccca aaggccagaa 
aaggcattqa tqtccqtqct ttggccataa 
ttgaggaact tctcagaaaa geeggoottt 
attccaagga agtcatctcc tcctagagca 
tagtcatgag actggtagcg gt a gate etc 
atgtgcatac ggagggattg aaatccatag 
aaccagttge Ictcctcaaa taccagagtc 
agaggctgat get caeca tg gggcgcetgc 
gggcagtggc gagtgccgcg aagctgggcg 

gcgtcaataa qaaqctegge ctcctgggtg 
tcgcagctcc caccaaggcc ctggaaaatc 
tgaaggccaa gaacttcaag aagagatgee 
aeggggatga aqactqcctg tacctcaaca 
cccgggarct gcocgttn tg atctggatct 
atggggccaa cttcctcaac aactacctgt 
aegtcategt ggtcaccttc aactaecqtg 
acgccaatct gecaggtaac tatggectte 
agaggaata'L eqcqqocttc gygggygacc 
ctggaggtgc cagcgtctct ctgcagaccc 
gagec6tcaq eeaqegcgqc gtqgecctga 
tctgggccaa aaaggtggct gagaaggtgg 
cccagtgtct gaaggttact gatccccgag 
caggectqqa gtaccccatg ctgcactatg 
tcatccccgc tgacccgatc aacctgtacg 
gcaccaaeaa eatqqacqqc cacatcttcg 
gcaacaagaa agtcaeggag gaggacttct 
aggggctcag aggegecaag acgacctttg 
catcccaqga gaataagaag aagactgt.gg 
tqcccaccga gattgeccta gcccagcaca 
cctacctgtt ttcccatccc teteggatge 
atgcagatga eai:teaqLac qttuteggga 
cccasqacsq gacagtctct aaggccatga 
gggaccccno ca tgggegae tcggctgtgc 
acagcqqeta cctqqagatc accaagaaga 
qaaccaactr cctgcgctac tggaccotca 
aggaggecac CCCtgtgeco GCCacaqqgg 
gtgacl ccrjci gy ccqccccc qtqccqccca 



aaccaegaat accgcttgtg agecagtaag 60 
ttctccgggg gcttatagtg agyatatgea 120 
gacagaaaga gctttaaaat ttgtgcttta 180 
acatatgttc catctyttga tggctcagga 24 0 
ccaetactgt taatgaacaa atctgaatta 300 
ttatctatgg gtctagaget aaagatagat 360 
actatatgtt gataatattg atggaattcc 420 
atattcgaca tttttcgatt eaacggagtg 4 80 
cgagtctcag ttccaatttg gtagaccact 540 
ttggtctttg ttttctcctc gatgtacatt 600 
gectgeagaa gccctgagca aacaagttta 660 
teagaacagg acagtgacac cgcccccaca 720 
ttgaccctca taacaggagc aatcatttca 780 
tqcaaggttt caagasgtgt tcgattaagc 34 0 
tgaagtaatt tttcgacact actgaaggca 900 
atgaacttgt ctaacacgtc ctctacccac 960 
cgccaaacta atttaatcac gttaattatg 1020 
tctccattat atatccccag taggccaccc 1030 
aactggttgt gttgggcctc acctgctgct 1140 
ccgtgtacac agaaggtggg ttcgtggaag 1200 
actctgtgga catcttcaag ggcatcccct 1260 
ctcagccaca tcctggctgg caagggaccc 1320 
tgcaggccac catcacccag gaeagcacct 1380 
tttgggtgcc ccagggcagg aagcaagtct 14 4 0 
atggaggcgc cttcctcatg gggtoeggee 1500 
atgaeggega ggagatcgee acacgeggaa 1560 
tcggccccct tgggttcctc agcactgggg 1620 
gggatcagca catggecatt gcttgggtga 1680 
ccaacaacat cacgctctte ggggagtctg 174 0 
tctcccccta caacaagggc ctcatccggc 1300 
gtccctgggt catccagaaa aacccactct 18 60 
gttgccctgt gggtgatgcc gecaggatgg 1920 
ccctgacgct ggcctataag gtgccgctgg 1930 
tgggcttcgt ccctgtcatt gatggagact 2040 
ccaacgccgc cgacatcgac tatatagcag 2100 
ccagcatcga catgcetgcc atcaacaagg 2160 
acaagctggt cagtgagttc acaatcacca 2220 
atgtctacac cgagtcctgg gcccaggacc 2280 
tggactttga gaccgatgtc ctcttcetgg 2340 
gagecaatge caagagtgee aagaeetacg 2400 
ccgtetacec caaatgggtg ggggcegacc 24 60 
agcccttcgc cacccccacg ggctaccggc 2520 
tcgcctactg gaccaacttt gecaaaacag 258 0 
ecacacactg ggaaccctac aotacggaaa 2640 
tgggcagcag ctccatgaag eggagectga 2700 
cetatctqqc gctgcccaca gtgaccgacc 27 60 
actccgaggc cactcccgtg ccccccacgg 2820 
egggtgacto cggggccccc cccgtgccgc 28 80 



14 



ccacyggtga ctccggggcc cc ccccgtgc 
tgccgcccac gggtgarrtcc ggggcccccc 
cccccgtgcc gcc.cccqtqB cccccacggg 
gggtgact^c: ggggcccccc ctgfcgccccc 
agtcattagq ttttagcgtc ecatgagcct 
c:agggqct cc nctcccatct tgagctcttc 



cgcccacggg tgactccggg gccccccccg 2940 
ccgtgccgcc cacgggtgac tccggggccc 3000 
tgactccgag accgcccccg tgccgcccac 3060 
cactgactcc aaggaagctc agatgcctgc 3120 
tggtatcaag aggccacaag agtgggaccc 3180 
ctgaataaag cctcataccc ctgaaaaaa 3239 



<210> i f j 
<211> 917 
<212> DN/\ 

< 2 1 3> H omo s ap lens 
<400> 19 

gycacgagra gecccggagn ccgygccagg 
ccctccf.gc:C gcagccaccg agccgccgfcc 
ctgctggr.gc gectgcttct ctgcgtcctg 
cttcatcaag ttccitcgad ctgtgactgt 
tacttctcca acattcactg gtgcaactgc 
aLdgataagt caaaaacctg ctatgagggg 
actgacacca tgggccggcc ctgcctgccc 
taccatgcoc acaqatctga tgctcttcag 
gaggtggggg cacaaggarrc aaaagccctc 
ct^cttctccc agagggctgg ccataycaca 
ctgagggg^g cjagqcaggga aggccctctg 
ttgccaggaa cccagacaac cggaggcgac 
cgcttgtcca rigagtgcatg gtgcatgact 
gatgacagtg gggtggaagg ggacaaactt 
tgaggagqtg gggggLgccc gagagggatg 
ctctgtttgt CClocag 



gtccacctgt ccccgcageg ccggctcgcg 60 
tagcgccccrg acctcgccac catgagagcc 120 
gtcgtgagcg actccaaagg cagcaatgaa 180 
ctaaatggag gaacatgtgt gtccaacaag 240 
ccaaagaaat tcggagggca gcactgtgaa 300 
aatggtcact tttaccgagg aaaggccagc 360 
tggaactctg ccactgtcct tcagcaaacg 420 
ctgggcctgg ggaaacataa ttactgcagg 480 
cctacagttc ccagaaacct tgttaccatc 540 
agagaagtgc agcctctggt tgagtcttcc 600 
ggttggaatg acatccccfca tctttctgtg 560 
cctggtgcta tgtgcaggtg ggcctaaagc 720 
gcgcagatgg tyagcatcac ttgacctgct 780 
acatgtcccc ttattccatc acagyaggac 840 
ctttctccta cctgcctccc taagacatcc 900 

<J17 



<210> 20 
<2ll> 181 y 

<212> DNA 

<213> Homo sapiens 
<400> 20 

aaggtacatg ttctgct ect attaactatt 
acttagggct cttgccacgc ttgttagtat 
ttgagctgtt ttgctggaat gagagcttgt 
aatcLtccac agggtlragtt aattcaagac 
catagcaaat gaqactcaaa cttcctcccc 
agactaaaac acagaqggtt gaagaaagcc 
tttcctctot ettttccctt gtcttcctta 
a ate tec dag cctatcaagt ttgagetttc 
taaattctgt yqagtatgta COqacggccg 
gccggtggag ttcnagtgcc ctgaeggega 
gacctgtgcc tgecattaca actgtcccgg 
caggaagatg tacggagaca tggcatgaag 
ct.ggaacttg * act gat tea catcrtcattt 
tatttaaatc tgtttttcta actgggggaa 
catgtcaaac aaatagLcta tcaaccccag 
agtggaacta cattagtaca cagcaccaga 
tggqagggto CCagcagaaa ggttagtatc 
otgctatTitg aaqtgtaatt gagasggaaa 
ccccagtgac agctaggatg tgcattctcc 
ttaagtcaga ocagcagact cagcLctgac 
cggaatccLy tcgattagac tggacagctt 
gattttttaa aatttatatt gtaaatattg 
tatgtacagt. latctaagtt aatttaaagt 



tttcacagga aaaacagtgg ataggaccca 60 
aagcccgtta tctccaaaac tatctaacca 120 
gtaatagcaa ccaccagttt tccactacga 130 
attccaagag aggctctggc tatttttgga 24 0 
tcaaaatata aacagaagtc agacaacaga 300 
actcctcttg tagagtcget gatttttttt 360 
gaagggcaaa aagtgcatcc gtactcccaa 420 
tggctgcacc agcatgaaga cataccgagc 480 
atgctgcacc ccccacagaa ccaccaccct 540 
ggtcatgaag aagaacatga tgttcatcaa SCO 
agacaatgac atctttgaat cgctgtacta 660 
ccagagagtg agagacatta actcattaga 720 
ttcogtaaaa atgatttcag tagcacaagt 760 
aagattccca cccaattcaa aacattgtgc 840 
acactggttt gaagaatgtt aagacttgae 900 
atgtatatta aggtgtggot ttaggagcag 360 
atcagatagc atcttatacg agtaatatgc 1020 
attttagegt gctcactgac ctgcctgtag 1080 
agecatcaag agactgagtc aagttgttcc 1140 
attctgattc gaatgacact gttcaggaat 1200 
gtggcaagtg aatttgcctg taacaageca 1260 
tgtgtgtgtg tgtgtgtgta tatatatata 1320 
tgtttgtgcc ttttt^tttt tgtttttaat 1380 
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gctttgatat ttcsolqtta qcctcaattt 
tctgatcgtt caaagcatga aatggatact 
agtccgtcaa aacagattgt ttgcaaaggg 
c 1 3 gg t agg a a 3 L q t ggt ag c c t c 1 1 tt 
gtyactctat atagclqatc agttttttca 
actgt fctttc ggacaqttld tttgttgaga 
tctagttgaa aabaaagtgt atattttttc 
agtagtagta ggcgyeagc 



ctgaacacca taggtagaat gtaaagcttg 14 40 
tatatggaaa ttctgctcag atagaatgac 1500 
geggcatcag tgtccttggc aggctgattt 3.5 60 
aatgaacaaa tggcctttat taaaaactga 1620 
cctggaagca tttgtttcta ctttgatatg 1680 
gtgtgeccaa aagttaoatg tttgcacctt 1740 
t?taaaaaat gfccgacgcgg ccgcgaattt 1S00 

1819 



<210> 21 
<211> 2294 
<23 2> DNA 

<213> Homo sapiens 
<40D> 21 

tcscaacaac tcttccccgc tgayaggaga cagcgagtgc gactccaccc tccagctcga 60 

c.ggcagccgc cccggccgac agccccgaga cgacagcccg gcgcgtcccg gtccccacct 120 

ccgaccaccg ccagcgctcc aggccccgec gctccccgct cgccgccacc gcgccctccg 180 

ctccgcccqc aytgocaacc atgaccgccg ccagtatggg ccccgtccgc gtcgccttcg 240 

tggtcctcr.t cgccctctgc agccggccgg ccgtcggcca gaactgcagc gggccgtgcc 300 

ggtgcccgga cgagccggeg ccgcgctgcc cggcgggcgt gagcctcgtg ctggacggct 360 

gcggctgctg ccgcytctge gccoagcagc tgggcgagct gtgcaccgag cgcgacccct 420 

gcgacccgca caagggcctc ttctgtgact tcggctcccc ggccaaccgc aagatcggcg 4 80 

tgtgcaccgc caaagatggt gctccctgca tctteggtgg tacggtgt&c cgcagcggag 540 

ogtccttcca gagcagctyc aagtaccagt gcacgtgcct ggacggggcg gtgggctgca 600 

tgcccctgtg cages tggac gttcgtctgc ccagccctga ctgccccttc ccgagcttac 660 

cgactggaag acacgtttgg cccagaccea actatgatta gagccaactg cctggtccag 720 

accacagagt ggagcgcctg ttccaagacc tgtgggatgg gcatctccac ccgggttacc 780 

riatgacaacg r:ctcctgcag gciagageag cagagccgcc tgtgcatggt caggccttgc 34 0 

gaagtgacct gqaagagaao atta^gaagg gcaaaaagtg catccgtact cccaaaatct 900 

ccaagcc1.at cadgtttgag ctttctgget gcaccagcat gaagacatac cgagctaaat 960 

tctgtggagt atgt/iecgac gqccgatgct gcacccccca cagaaccacc accetgccgg 1020 

tggagf.tcaa gtgccctgac ggcgaggtca tgaagaegaa catgatgttc atcaagacct 1080 

gtgcctgccn ttacaactgt cccggagaca atgacatctt tgaatcgctg tactacagga 1140 

agatgtocgg agacatggca tgaagcoaga gagtgagaga cattaactca ttagactgga 1200 

acttgaactg attcacatct catttttccg taaaaatgat ttcagtagca caagttattt 1260 

aaatctgttt ttctaactgg gggaaaagat tcccacccaa ttcaaaacat tgtgccatgt 1320 

caaacaaata gtctaf.caac cccagacact ggtttgaaga atgttaagac ttgacagtgg 1380 

aactacatta gtacaeagca ocagaatgta tattaaggtg tggctttagg agcagtggga 144 0 

gggtaccagc agaaaggtta gtatcatcag atagcatctt atacgagtaa tatgcctgct 1500 

atttgaagtg taattgagaa ggaaaatttt agcgtgctca ctgacctgcc tgtagcccca 1560 

gtgacagcta ggatgtgcat tctccagcca tca&gagact gagtcaagtt gttccttaag 1620 

tcagaacagc dgactcaqct ctgacattct gattcgaatg acactgttca ggaatcggaa 1680 

tcctgtcgat tagactggac agcttgtggc aagtgaattt gcctgtaaca agccagattt 1740 

tttaaaattt atettgtaaa tattgtgtgt gtgtgtgtgt gtgtatatat atatatatgt 1800 

ac^gttatct aagttaattt aaagttgttt gtgccttttt atttttgttt ttaatgcttt I860 

gotatttcaa tgttagcctc aatttctgaa caccataggt agaatgtaaa gcttgtctga 1920 

tcgttcaaag catgaaatgg atacttatat ggaaattctg ctcagataga afcgacagtcc 1980 

gtcaaaacag af.i'gtttgca aaggggaggc atcagtgtcc ttggcaggct gatttctagg 2040 

taggaaatgt- gqtagcctca cttttaatga acaaatggcc tttattaaaa actgagtgac 2100 

tctatatacfc tgatcagttt tttcacctgg aagcatttgt ttctactttg atatgactgt 2160 

ttttcggaca gtttritttgt fcgagagtgtg accaaaagth acatgtttgc acctttctag 222Q 

ttgaaaatan yqtgtatatt ttttctataa aaaatgtcga cgcggccgcg aatttagtag 2230 
tagtaggcgg czaqc 2234 



<2\0> 22 
<?.11> 5 94 
<212> DNA 

<213'> Homo sapiens 
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<400> 22 

tg^aatatgg agaatagtct tagatgtgtt 
ggagcttcct tgctcagcgc gcatcttcaa 
gcactgcgct tcctctcaga accttctgat 
ctcgactgct Cugcggagtc cgaccgagga 
ttcatctggc cttcjggraatg gatgoaagga 
tacaaaacat autt.cattnc agaeaccaca 
catctttagg agattctggc tcaattatta 
etacttagtc toqtctcatc ttggoagact 
ggacaafctgg ycagaatcac cactccttca 
tgaaatctqa aaqatgtyat tttgtcaggt 



cgggtaccca agctggcttt tgtactcttc 60 

gtaaccggtt ttcaaattaa agctttcaca 120 

gccgtcacaa tgcggggagg aaatgtcctc 180 

gttccagtga tcaagtggaa gaaagatgca 240 

agcagcaact ttcaaatggg tctctgctga 300 

agccagatga gggactttac caatgtgagg 360 

gtcggacagc aaaagttgca gtagcaggtc 4 20 

gatcagaata actcattttc atcaataaca 480 

tgcactctcc tagtgagggt ctgagaaagt 540 

ggtgctataa agaattacat ctca 594 



<210> 23 

<ni> ft81 

<2V2> DNA 

<213> Homo sapiens 
<400> 23 

cttatacnag cgt tat gaga tcaagnngac 
ggatgccgct qac^tccggt tcgtetacac 
ccacaggtcc cacaaccgca gcagagg&gt 
tcttgcacat eactaectgc agtttcgtgg 
gccggggctt caccaagaoc tacectgttg 
acactggcgg rregttactag tggatccgag 
tattctatag tgtcacctaa atagcttggc 
aaattgttat r.cqctcacaa ttceacacaa 
ctggggtgcc taatgagtga gctaactcac 
ceagtcggga aacctgtcgt gccagctgca 
cggtttgcgt attgggcgct cttccgcttc 
tcngctgcgcf cgagcggtat cagctcactc 
ggggataacg caqaaagaac atgtgagcaa 
gcgngttntg gcgttttcct aagctccgc.c 
tanagggggg aacccgnuag acttaaaent 



caagatgtat aaagggttcc aagccttang 60 
ccccgccatg gsgagtgtct gcggatactt 120 
ttctcattgc tggaaaactg caggatggac 18 Q 
ctccctggaa cagcctgagc ttagctcagc 240 
gctgtgagga atgcaagggc gaattccagc 300 
ctcggtaccs agcttgatgc atagcttgag 360 
gtaatcatgg tcatagctgt ttcctgtgtg 420 
catacgagcc ggaagcataa agtgtaaagc 4 30 
attaattgcg ttgcgctcac tgcccgcttt 540 
ttaatgaatc ggccaacgcg cggggagagg 600 
ctcgctcact gactcgctgc gctcggtcgt 660 
aaangcggts atacggtttn cacagaatca 720 
aaggcagcaa aariggcagga aecgnaaaag 730 
ccctgacDag catncaaant tnacgctcag 840 
ccagcgttcc c 881 



<210> 24 
<?U> 893 
<212> DNA 

<213> Homo sapiens 
<400> 24 

cttatacnaq cgttatgaga tcaagnngac 

agcattttct eacatcLtcc tcctggtcaa 

agaaagttgg ct.gtgatctt gggatgctat 

gattatgtcg gtaaaagaga cacttcccct 

agccttaggq gatgccgctg acatccggtt 

cggatscttc cacaggfcccc acaaccgcag 

gtccccgcgc cctgtgccac aceaaccagt 

ggcgaggctc tgatgggaat ggtcccactg 

cagaaggccg ggcctttggc agcttqgcrag 

CCyccccgcc t.t.t ctcct;ta ggaaaact.gc 

gttttgtggc tccctggaac: agcctgagct 

acacrtgttgg ctqtgsggaa. tgcaagggcg 

gatccnagct cggnccaagc ttgatgcata 

gcttggcgta tcatgggcat agctgttcct 

cacatncnan ccggarignta aayngnaagc 



caaggtatcc octgcccccg cctctttccc 60 
aacagaaact tctggccact ggttggtgcg 120 
atgacttcag ggaagaacct ggagtgggag 130 
catccatcaa cagatgtata aagggttcca 24 0 
cgtctacacc cccgccatgg agagtgtctg 300 
cgaggagttt ctcattgctg gtgaggcacc 360 
ccctggggcg cggcctagca accacgaggg 420 
gaaatgggga ccaccccaai: ttcagtctat 480 
ctcagccaat cacaagctgc ttgtcggtcc 540 
aggatggact cttgcacatc actacctgca 600 
tagctcagcg ccggggcttc accaagacct 660 
yattccagca cactggcggc cgttctagtg 720 
gcttgagtat tctatagtgn cacctaaata 780 
gnggaaatng ntatccgtna caattccnca S40 
cbqggtgcna tgagngacta ctn 853 
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<21l> 887 
<U2> DNA 
< 2 3. 3 > Homo sapiens 

<40Q> 25 

tcgaccgccc cttgqcttr.t geactgatgg tgggtggatq agtaatgcat ccaggaagcc 60 
tggaggcctg tggtttccgc acccgctgcc acccccgccc ctagcgtgga catttatcct 120 
ctagcgctca qgccctqccg ccatcgccgc agatccagcg cecagagaga caccagagaa 180 
cccaccatgg ccc^ctttg-i gcccctggct tctggcatcc tgttgttgct gtggctgata 24 0 
gcccccaqca gggcctgcac ctgtgtccca ccccacccac agacggcctt ctgcaattcc 300 
gacctcgtca t cngggccaa gttcgtgggg acaccagaag tcaaccagac caccttatac 360 
cagcgttatq aqatcaagat gaocaaqatg tataaagggt tccaagcctt aggggatgoc 42 0 
qctgacatcc ggttcqtcta cacccccgcc atggagagtg tctgcggata cttccacagg 480 
abggactctt gcacatcact acctgcagtt tcgtggctcc ctggaacagc ctgagcttag 540 
ctcagcgccg gggcttcacc aagacctaca ctgttggctg tgaggaatgc acagtgtttc 600 
cctgtttatc cntcccetgc aaactqcaga gtggcactca ttgcttgtgg acggaccagc 560 
tcctccaagg ctc.tgaaaag ggcttccagt cccgtcacct tgcctgcctg cctcgggagc 720 
cagggctgtq cacctqgcag tecctgcggt cccagatagc ctgaatcctg cccggagtgg 780 
aagctgaagc ctgcac.agt.g tccaecctgt tcccactccc atctttcttc cggacaatga 840 
aataaaqagt fcaccacccca acccaataaa actgatgtta ctagcca 887 



<210> 2 6 

<211> 814 

<212> DMA 

<213> Homo sapiens 

<400> 26 

tcgaccgccc cttggcttct gcactqatgg 
tggaggcctg tggtttccgc dcccgctgcc 
ctagcgctca ggccctgccg ccatcgccgc 
tgcaattccg acctcgtcat cagggccaag 
accttatace agcgtf.atga gatcaagatg 
ggggatgccg ctgaeatccg gttcgtctac 
ttccacaggt cceacnaccg cagcgaggag 
ctcf.tgcaca t cactacctq cagtt.tcgtg 
cgccggggct teaccaagac ctocactgtt 
ttatccatcc cctgcaaact gcagagtggc 
caaggctctg aaaagggctt ccagtcccgt 
ctgtgcacct ggcagtccct gcggtcccag 
qaagcctqca cagtqtccac cctgttccca 
agagttacca cccagcag^a aaaacaaaca 



tgggtggatg agtaatgcat ccaggeagcc 60 

acccccgccc ctagcgtgga catttatcct 120 

agatccagcg cccagagaga cacc&gattc ISO 

ttcgtgggga caccagaagt caaccagacc 24 0 

accaagatgt ataaagggtt ccaagcctta 300 

acccccgcca tggagagtgt ctgcggatac 360 

tttctcattg ctggaaaact gcaggatgga 420 

gctccctgga acagcctgag cttagctcag 480 

ggctgtgagg aatgcacagt gtttccctgt 540 

actcattgct tgtggacgga ccagctcctc 600 

caccttgcct gcctgcctcg ggagccaggg 660 

atagcctgaa tcctgcccgg sgtggaagct 720 

ctcccatctt tcttccggac aatgaaataa 730 
agtc 814 



<210> 27 
<211> 491 
<212> PKT 

<213> Homo sap ions 
<4 00> 27 

Met Arg Glu Asn Met Ala Arg Gly Pro Cys Asn Ala Pro Arg Trp Val 
1 5 10 15 

Ser Leu Met Val L«*u Val Ala He Gly Thr Ala Val Thr Ala Ala Val 
20 25 30 

h$n Pro Gly Val. Val Val Arg lie Ser Gin bys Gly Leu Asp Tyr Ala 
35 AO 45 

Ser G3n Gin Gly Thr Ala Ala Leu Gin Lys Glu Leu Lys Arg lie Lys 
.SO 55 60 



He Pro Asp Tyr Ser Asp Scr Phe Lys Lys His Leu GXy Lys GJ y 

65 > 7 0 75 80 

Mis Tyr S<*r Phe Tyr Ser Met Asp He Arg Glu Phe Gin Leu Pro Ssr 
85 90 95 

Ser Gin lie Ser Met Val Pro Asn VH Gly Leu Lys Phe Ser IJe Ser 
100 105 no 

Asn Aid Asn lie: Lys He Ser Gly Lys Trp Lys Ala Gin Lys Arg Ph<= 
U5 120 125 

Leu Lye Met Ser Gly Asn Phe Asp Leu Ser lie Glu Gly Met Ser He 
130 135 140 

Ser Ala Asp Leu Lys Leu Gly Ser Asn Pro Thr Ser Gly Lys Pro Thr 
145 150 155 160 

He Thr Cys Ser Ser Cys Ser Ser His He Asn Ser Val His Val His 
165 170 175 

He S^r Lys Ser Lys Val Gly Trp Leu He Gin Leu Phe His Lys Lys 
180 135 190 

lie Glu Ser Ala Lou Arg Asn Lys Met Asn Ser Gin Val Cys Glu Lys 
T 35 200 205 

Val Thr Asn Ser Val Sex Ser Lys Leu Gin Pro Tyr Phe Gin Thr Leu 
210 215 220 

Pro Val Met Thr Lys Tie Asp Ser Val Ala Gly He Asn Tyr Gly Leu 
225 230 235 240 

Val Alri Pro Pro Ala Thr Thr Ala Glu Thr Leu Asp Val Gin Met Lys 
245 250 * 255 

Gly Glu Phe Tyr Ser Glu Asn His His Asn Pro Pro Pro Phe Ala Pro 
260 265 270 

Pro Val Met Glu Phe Pro Ala Ala His Asp Arg Met Val Tyr Leu Gly 
275 280 2y5 

Leu Ser Asp Tyr Phe Phe Asn Thr Ala Gly Leu Val Tyr Gin Glu Ala 
290 295 300 

Gly Vol Leu Lys Met Thr Leu Arg Asp Asp Met He Pro Lys Glu Ser 
305 HO " 315 320 

Lys Phe Arq Leu Thr Thr Lys Phe Phe Gly Thr Phe Leu Pro Glu Val 
325 330 335 

Aid Lys Lys Phe Pro Asn Met Lys He Gin lis His Val Ser Ala Ser 
340 345 350 



Thr Pro Pro His Leu Ser Val Gin Pro Thr Gly Leu Thr Phe Tyr Pro 
355 360 365 



Ala Val. Asp Val Gin Ala Phe Ala Val Leu Pro Asn Ser Ser' Leu Ala 
370 375 380 
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Ser Leu l?he Leu 
Glu Glu Pro Gin 



His Val Asn Pro 
420 

Leu Arg Lea Glu 

His Ser Pro Ser 

Tyr Ser Phe 5or 
-J 65 

Ala 



lie Gly Met Gly 
330 

Thr Val Pro Thr 

405 

Val Trp He Gin 



Ser Leu Tyr Ser 
440 

His Ser Leu Val 
455 

Gin Phe Leu He 
470 



lys Gin Phe Leu 
395 

Ala Leu Ser Leu 
410 

Thr Trp Thr Val 
425 

Met Val Pro Thr 



Arg Leu Phe Thr 

His Ser xaa lie 
475 



Gly Trp Thr Asp 
400 

Glu Ser Gly Asp 
415 



Ser Leu Arg Ser 
430 

Pro Gly Gly He 
445 

Tyr Ser Phe Asn 



His Ser Met Leu 
480 



<210> 28 
<2L1> 628 
<212> PRT 

<213> Homo sapiens 
•U00> ^8 

Met Gly Arg Leu Gin Leu Val Val Leu Gly Leu Thr Cys Cys Trp Ala 
1 " 5 10 15 

Val Ala Ser Ala Ala Lys Leu Gly Ala Val Tyr Thr Glu Gly Gly Phe 
20 25 30 

Val Glu Gly Val Aan Ly3 Lys L®u Gly Leu Leu Gly Asp Ser Val Asp 
.35 40 45 

Tic Phe J.ys Gly Tic fro Phe Ala Ala Pro Thr Lys Ala Leu Glu Asn 
50 55 60 

Pro Gin Pro His Pro Gly Trp Gin Gly Thr Leu Lys Ala Lys Asn Phe 
65 70 75 80 

Lys Lys Arg Cys Leu Gin Ala Thr He Thr Gin Asp Ser Thr Tyr Gly 
8!j 90 95 

Asp Glu Asp Cys Leu Tyr Leu Asn He Trp val Pro Gin Gly Arg Lys 
1.00 105 110 

Gin Val Ser Arg Asp Leu Pro Val Met He Trp He Tyr Gly Gly Ala 
115 120 125 

Phe L^u Mel: Gly Ser Gly His Gly Ala Asn Phe Leu Asn Asn Tyr Leu 
130 135 140 

Tyr Asp Gly Glu Glu He Ala Thr Arg Gly Asn Val He Val Val Thr 
145 150 155 160 



Phe Asn Tyr Arq Val Gly Pro Leu Gly Phe Leu Ser Thr Gly Asp Ala 
165 170 175 



20 



Asn Leu Pro Gly Asn Tyj. Gly Leu Arg Asp Gin His Met Ala lie Ala 
ISO 135 130 

Trp Val Lys Arg Asn He Ala Ala Phe Gly Gly Asp Fro Asn Asn He 
195 200 205 

Thr Leu Phe Gly Clu Ser Ala Gly Gly Ala Ser Val Ser Leu Gin Thr 
210 220 

Leu Ser Pro Tyr $r,n Lys Gly Leu He Arq Arg Ala lie Ser Gin Ser 
225 * 230 235 240 

Gly Val Alri Leu Ser Pro Trp Val He Gin Lys Asn Pro Leu Ph.© Trp 
245 250 255 

Ala Lys Lys Val Ala Glu Lys Val Gly Cys Pro Val Gly Asp Ala Ala 
260 265 270 

Arg Met Ala Gin Cys Leu Lys Val Thr Asp Pro Arg Ala Leu Thr Leu 
275 280 235 

Ala Tyr Lys Val Pro Leu Ala Gly Leu Glu Tyr Pro Met Leu His Tyr 
290 295 300 

Val Gly Phe V,H Pro Val He Asp Gly Asp Phe He Pro Ala Asp Pro 
305 -310 315 320 

He A^n Lo U Tyr Ala Asn Ala Ala Asp He Asp Tyr He Ala Gly Thr 
3?5 -^0 335 

Asn Asn Mel" Asp Gly His He Phe Ala Ser lie Asp Met Pro Ala He 
340 345 350 

Asn LyA nly Asn Lys Lys Val Thr Glu Glu Asp Phe Tyr Lys Lea Val 
355 350 3b ~ 5 

S*r Glu 3 J b* Thr I.le Thr Lys Gly Leu Arg Gly Ala Lys Thr Thr Phe 
370 375 380 

Asp Val Tyr Thr Glu Ser Trp Ala Gin Asp Pro Ser Gin Glu Asn Lys 
385 390 395 400 

Lys Lys Thr Val Val Asp Phe Glu Thr Asp Val Leu Phe Leu Val Pro 
405 410 415 

Thr Clu lie Ala Leu Ala Gin His Arg Ala Asn Ala Lys Ser Ala Lys 
420 ^25 430 

Thr Tyr Ala Tyr Leu Phe Ser His Pro Ser Arg Met Pro Val Tyr Pro 
4 35 4 40 445 

Lys Trp Val Gly A La Asp His Ala Asp Asp He Gin Tyr Val Phe Gly 
450 455 460 

Lys Pro Phe Ala Thr Pro Thr Gly Tyr Arg Pro Gin Asp Arg Thr Val 
465 470 475 480 

Ser Lys Ala Met He Ala Tyr Trp Thr Asn Phe Ala Lys Thr Gly Asp 
435 -390 495 

Pro Asn Met Gly Asp Ser Ala Val J?ru Thr His Trp Glu fro Tyr Thr 
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bOO 505 510 

Thr Glu Asn Ser Gly Tyr Leu Glu lie Thr Lys Lys Met Gly Ser Ser 
515 520 525 

Ser Mst Lys Arg S-sr Leu Arg Thr Asn Phe Leu Arg Tyr Trp Thr Leu 
530 !i35 540 

Thr Tyr Leu Ala Leu Pro Thr Val Thr Asp Gin Glu Ala Thr Pro Val 
545 550 555 560 

Pro Pro Thr Gly Asp 3er Glu Ala Thr Pro Val Pro Pro Thr Gly Asp 
565 570 575 

Scr Glu Thr A La Pro Vgl Pro Pro Thr Gly Asp Ser Gly Ala Pro Pro 
530 585 590 

Val Pro Pro Thr Gly Asp Ser Gly Ala Pro Pro Val Pro Pro Thr Gly 
595; " 600 60S 

Asp S<=sr Gly Aid Pro Pro Val Pro Pro Thr Gly Asp Ser Glu Ala Arg 
610 615 620 

Ala His Leu Gly 
(525 



<210> 29 
<211> 641 
<212> PRT 

<213> Horao sapiens 
<4 00> 2 9 

Leu Leu Leu Leu Gly Phe Leu Leu Val Ser Leu Glu Ser Thr Leu Ser 
1 b 10 15 

He Pro Pro Trp Glu Ala Pro Lys Glu His Lys Tyr Lys Ala Glu Glu 
20 25 30 

His Thr Val Val Leu Thr Val Thr Gly Glu Pro Cys His Phe Pro Phe 
35 40 45 

Gin Tyr His Arg Gin Leu Tyr His Lys Cys Thr His Lys Gly Arg Pro 
50 55 60 

Gly Pro Gin Pro Trp Cys Ala Thr Thr Pro Asn Phe Asp Gin. Asp Gin 
65 70 75 80 

Arq Trp Gly Tyr Cys Leu Glu Pro Lys Lys Val Lys Asp His Cys Ser 
a. L ; 90 95 

Lys His Ser Pro Cys Gin Lys Gly Gly Thr Cys Val Asn Met Pro Ser 
100 105 110 

Gly Pro His Cys Leu Cys Pro Gin His Leu Thr Gly Asn His Cys Gin 
Hi; 120 125 

Lys Glu Lys Cys Pha Glu Pro Gin Leu Leu Arg Phe Phe His Lys Asn 
130 135 140 



Gin 11* Trp Tyr Arg thr Glu Gin Ala Ala Val Als Arg Cys Gin Cys 
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145 150 155 160 

Lys Gly Pro Asp Ala His Cys O.ln Arg Leu Ala Ser Gin Ala Cys Arg 
165 170 175 

Thr Asn Pro Cys Leu His Gly Gly Arg Cys Leu Glu Val Glu Gly His 
1H0 185 190 

Arg Ls>u Cys His Cys Pro Va ! Gly Tyr Thr Gly Pro Phe Cys Asp Val 
195 200 205 

Asp Thr Lys Ala Ser Cys Tyr Asp Giy Arg Gly Leu Ser Tyr Arg Gly 
210 215 ' * 220 

Leu Ala Arg- Thr Thr Leu Ser Gly Ala Pro Cys Gin Pro Trp Ala Ser 
Z25 ' 230 235 ' 240 

Glu Ala Thr Tyr Axg Asn Val Thr Ala Glu Gin Ala Arg Asn Trp Giy 
24b 250 255 

Leu Gly Gly His Ala Phc Cyr, Arg Asn Pro Asp Asn Asp lie Arg Pro 
260 265 270 

Trp Cys Phe V»il Leu Asn Arg Asp Arg Leu Ser Trp Glu Tyr Cys Asp 
275 230 285 

Leu Ala Gin Cys Gin Thr Pro Thr Gin Ala Ala Pro Pro Thr Pro Val 
290 29b 300 

Ser Pro Arg Lou His Val Pro Leu Met Pro Ala. Gin Pro Ala Pro Pro 
305 " 310 315 320 

Lys Pro Gin Pro Thr Thr Arg Thr Pro Pro Gin Ser Gin Thr Pro Gly 
3/5 330 335 

Ala Leu Pro Ala Lys Arg Glu Gin Pro Pro Ser Leu Thr Arg Asn Gly 
34 0 345 350 

Pro Lorn 3<ar Cys G'l y Gin Arg Leu Arg Lys Ser Leu Ser Ser Met Thr 
355 360 " 265 

Arg Val val Gly Gly Leu Val Ala Leu Arg Gly Ala His Pro Tyr lie 
370 37b 330 

Ala Ala Lrau Tyr Trp Gly His Ser Phe Cys Ala Gly Ser Leu lie Ala 
385 390 395 400 

Pro Cys Trp Vdl Leu Thr Ala Ala His Cys Leu Gin Asp Arg Pro Ala 
4()r> 410 * 415 

Pro Glu Asp Leu Thr Vol Leu Gly Gin Glu Arg Arg Asn His Ser 

420 425 430 

Cys Glu Pro Cyr, Gin Thr Leu Ala Val Arg Ser Tyr Arg Leu His Glu 
4 35 440 44 5 

Ala Phe Ser Pro Val Ser Tyr Gin His Asp Leu Ala Leu Leu Arg Leu 
450 455 460 

Gin Glu Asp Ala Asp Giy Ser Cys Ala Leu Leu Ser Pro Tyr Val Gin 
465 470 475 430 
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Pro Val Cys Leu Pro Ser Gly Ala Ala Arg Pro 3er Glu Thr Thr Leu 

4HU 4S?6 495 

Cys Gin Vol Ala Gly Trp Gly His Glu Phe Glu Gly Ala Glu Glu Tyr 

500 505 510 

Ala Scr Phe T.eiu Gin Glu Ala Gin Val Pro Phe Leu Ser Leu Glu Ary 

blh h20 525 

Cys Ser Ala Pro Asp Val His Gly Ser Ser lie Leu Pro Gly Met Leu 

530 535 ' 540 

Cys Ala Gly Phe Leo Glu Gly Gly Thr Asp Ala Cys Ala Gly Glu Leu 

S45 550 555 560 

Leu 'Ala Gly Trp Arg Pro Ser Pro Arg Pro Ser Ala Xaa Ser Gin Val 

565 570 575 

His Ser Ala Asp Cys Val Phe Pro Thr Gin Gly Asp Ser Gly Gly Pro 

5U0 585 590 

Leu Val Cys Glu Asp Gin Ala Ala Glu Arg Arg Leu Thr Leu Gin Gly 

595 600 605 

lie He Ser Trp Gly Ser Gly Cys Gly Asp Arg A3n Lys Pro Gly Vdl 

610 615 620 



Tyr Thr Asp Val Ala Tyr Tyr Leu Aid Trp lie Arg Glu his Thr Val 
625 «30 635 640 



<210> 30 
<2U> 164 
<212> PRT 

<2 1 3> . Homo sapiens 
<400> 30 

Met Ala Pro Phe Glu P.ro Leu Ala Ser Gly He Leu Leu Leu Lsu Trp 
1 5 10 15 

Leu lie A.U Pio Ser Arq Ala Cys Thr Cys Val Pro Pro His Pro Gin 
20 25 30 

Thr Ala Phe Cys Asn Ser Asp Leu Val lie Arg Ala Lys Phe Val Gly 
35 40 45 

Thr Pro Glu Val Ann Gin Thr Thr Leu Tyr Gin Arg Tyr Glu He Lys 
!i0 55 60 

Met Thr Lys Met Tyr Lys Gly Phe Gin Ala Leu Gly Asp Ala Ala Asp 
65 " 70 75 SO 

He Arg Phe Vdl Tyr Tb.C Pro Ala Met Glu Ser Val Cys Gly Tyr Phe 
95 90 95 



His Arg Ser His Asn Arg Ser Glu Glu Phe Leu He Ala Gly Lys Leu 
100 105 1X0 
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Gin Asp Gly Leu Leu His lie Thr Thr Cys Ser Phe Val Ala Pro Trp ' 
115 120 125 

Asn Ser Leu Ser Leu Ala Gin Arg Ary Gly Phe Thr Lys Thr Tyr Thr 
130 135 140 

Val Gly Cys Glu Glu Cys Thr Val Phe Pro Cys Ser His Ser His Leu 
14 5 150 155 IGQ 

Ser Ser Gly Gin 



<2io> 

<211> 123 

<212> PRT 

< 2 1 3 > Homo sapiens 

<400> 31 

Men Ala Pro Phe Glu Pro Leu Ala Ser Gly He Leu Leu Leu Leu Trp 
1 fa 10 15 

Leo He Ala Pro Ser Arg Ala Cys Thr Cys Val Pro Pro His Pro Gin 

20 25 30 

Thr Ala Fhe Cys Asn Ser Asp Leu Val He Arg Ala Lys Phe Val Gly 
35 40 45 

Thr Pro Glu Val Asn Gin Thr Thr Leu Tyr Gin Arg Tyr Glu tie Ly3 
50 55 60 

Met Thr Lys Mot Tyr Lys Gly Phe Gin Ala Leu Gly Asp Ala Ala Asp 
6b 70 75 80 

He Arg Phe Val Tyr Thr Pro Ala Met Glu Ser Val Cys Gly Tyr Phe 
85 90 95 

His Arg Ser His Asn Arg Sor Glu Glu Phe Leu He Ala Gly Lys Leu 
100 105 110 

Gin Val Val Met Cys Lys Ser Pro Ser Val Val 
115 120 



<210> '12 
<211> 211 
<212> PRT 

<213> Hojno sapiens 
<400> 32 

Met Ala Pro Phe Glu Pro Leu Ala Ser Gly He Leu Lou Leu Leu Trp 
15 10 15 

Leu He Ala Pro Ser Arg Ala Cys Thr Cys Val Pro Pro His Pro Gin 
20 25 30 

Thr Ala Phe Cys Asn Ser Asp Leu Val He Arg Ala Lys Phe Val Gly 
35 40 45 

Thr Pro Glu Vdl Asn C!n Thr Thr Leu Tyr Gin. Arg Tyr Glu He Lys 
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50 55 60 

Met Thr T.ys Met Tyr Lys Gly Phe Gin Ala Leu Gly Asp Ala Ala Asp 
65 70 75 ' 80 

He Arg Phe Val Tyr Thr Pro Ala Wet Glu Ser Val Cys Gly Tyr Phe' 
8 5 90 95 

His Arg Ser His Asa Arg Ser Glu Glu Phe Leu lie Leu Leu Gly Lys 
100 105 110 

Leu Gin Asp Gly He Phe Ala His Ssr Leu Thr Cys Ser Phe Cys Trp 
115 120 125 

val Pro Trp Glu Asn Ser Leu Ser Leu Ala Gin Arg Arg Gly Phe Thr 
130 ' 135 140 

Lys Thr Tyr Thr Val Gly Cys Glu Glu Cys Thz: Val Phe Pro Cys Lou 
145 150 155 160 

Ser lie Pro Cyn Lys Leu Gin Ssr Gly Thr His Cys Leu Trp Thr Asp 
165 170 175 

Gin Leu Leu Gin Gly Ser Glu Lys Gly Phe Gin Ser Arg His L^u Ala 
130 135 190 

Cys Leu Pro Arg Glu Pro Gly Leu Cys Thr Trp Gin Ser Leu Arg Ser 
135 200 205 

Gin He Ala 
210 



<210> 33 
<211> 160 
<212> PRT 

<213> Homo sapiens 
<4Q0> 33 

Met Ala Pro Phe Glu Pro Leu Ala Ser Gly He Lea Leu Leu Leu Trp 
1 5 10 15 

Leu He Ala Pro Ser Arg Ala Cys Thr Cys Val Pro Pro His Pro Gin 
20 " 25 30 

Thr Ala Phe Cys Asn Ser Asp Leu Val He Arg Ala Lys Phe Val Gly 
35 40 45 

Thr Pro Glu Val Asn Gin Thr Thr Leu Tyr Gin Arg Tyr Glu He Lys 
50 55 60 

Met Thr Lyb Met Tyr Lys Gly Phe Gin Ala L^u Gly Asp Ala Ala Asp 
65 70 75 80 

He Arg Phe Val Tyr Thr Pro Ala Met Glu Ssr Val Cys Gly Tyr ]?he 
85 90 95 

His Ajig Ser His Asn Arg Ser Glu Glu Phe Leu He Leu Ser lie Pro 
100 105 110 

Cys Lys Leu Gin Ser Gly Thr His Cys Leu Trp Thr Asp Gin Leu Leu 
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115 120 125 

Gin Gly Sex 01 u Lys Gly Phe Gin £5er Arg His Leu Ala Cys Leu Pro 
130 135 140 

Arcj Gli] Pro Gly Leu Cys Thr Trp Gin Ser Leu Arg Ser Gin lie Ala 

150 155 160 



<210> 34 
<21L>' 157 
<212> ]?RT 

<213> Homo sapiens 
<400> 34 

Met- Ala Pro Ph^ Glu Pro T-eu Ala Ser Gly He Leu Leu Leu Leu Trp 
1 5 10 15 

L<*u 11^ Ala Pro Ser Arg Ala Cys Thr Cys Val Pro Pro His Pro Gin 
20 25 30 

Thr Ala L'h** Cys Asn Ser Asp Leu Val lie Arg Ala Lys Phe Val Gly 
35 40 45 

Thr Pro Glu Val Asn Gin Thr Thr Leu Tyr Gin Arg Tyr Glu lie Lys 
50 55 60 

Mot Thr Lyfl M*t Tyr Lys Glv Phe Gin Ala Leu Gly Asp Ala Ala Asp 
65 " ' 70 75 80 

He Arg Pin-* Val Tyr Thr Pro Ala Met Glu Ser Val Cys Gly Tyr Phe 
85 90 

His Arq AJ ^ Gly Lys Leu Gin Asp Gly Leu Lqu His He Thr Thr Cys 
100 " 105 HO 

Ser Pne Val Ala Pro Trp Asn Ser Leu Ser Leu Ala Gin Arg Arg Gly 
115 120 125 

Fh* Thr Lys Thr Tyr Thr Val Gly Cys Glu Glu Cys Thr Val Phe Pro 
130 13b 1^0 

Cys Leu S^r He Pro Cys Lys Leu Gin Ser Gly Thr His Cys Leu Trp 
145 ' " 150 155 160 

Thr Asp Gin Leu Leu Gin Gly Ser Glu Lys Gly Phe Gin Ser Arg His 
T^: 170 17 5 

L<=u Ala Cys Leu Pro Arg Glu Pro Gly Leu Cys Thr Trp Gin Ser Leu 
180 185 190 

Aig Ser Gin He Aid 
19G 



<210> 35 
<211> 494 
<212> PP,T 
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<213> Homo sapiens 
<4 00> *J5L> 

Met Arg Ala T.eu T.eu Ala Arg Leu Leu Leu Cys Val Leu Val Val Ser 
1 5 10 IS 

Asp Ser Lys Gly Ser Asa Glu Leo His Gin Val Pro Ser Asn Cys Asp 
20 25 30 

Cys r.eu Asn Gly Gly Thr Cys Val Ser Asn Lys Tyr Phe Ser Asn lie 
35 40 45 

His Tip Cys Asa Cys Pro Lys Lys Pne Gly Gly Gin His Cys Glu lie 
50 b5 60 

Asp Lys Ser Lys Thr Cys Tyr Glu Gly Asn Gly His Phe Tyr Arg Gly 
65 70 75 30 

Lys Ala Ser Thr Asp Thr Met Gly Arg Pro Cys Leu Pro Trp Asn Ser 
35 90 95 

Ala Thr Val Leu C,Ln Gin Thr Tyr His Ala His Arg Ser Asp Ala Leu 
100 105 110 

Gin Leu Gly Leu Gly Lys His Asn Tyr Cys Arg Glu Val Gly Ala Gin 
115 12 0 125 

Gly Pro Lys Ala Leu Pro Thr V*I Pro Arg Asn Leu Val Thr lie Pro 
130 135 140 

Phe Ser Gin Arg Ala Gly His Ser Thr Arg Glu VaJ. Gin Pro Leu Val 
145 " 150 155 160 

Glu Ser r>or Leu Arq Gly Gly Gly Arg Glu Gly Pro Leu Gly Trp Asn 
165 170 175 

Asp lie Pro Tyr Lau Scr Val Leu Pro Gly Asn Pro Asp Asn Arg Arg 
180 185 1^0 

Arg Pro Trp CyS Tyr Val Gin Val Gly Lou Lys Pro Leu Val Gin Glu 
19b 200 705 

Cys Met Val His Asp Cys Ala Asp Gly Lys Lys Pro Ser Ser Pro Pro 
210 215 220 

Glu Glu Leu Lys Phe Gin CyS Gly Gin Lys Thr Leu Arg Pro Arg Phe 
225 230 235 24 0 

Lys lie lie Gly Gly Glu Pho Thr Thr lie Glu Asn Gin Pro Trp Phe 
245 250 255 

Ala Ala lie Tyr Arq Arq His Arq Gly Gly Ser Val Thr Tyr Val Cys 
2GQ 2 65 2*70 

Gly Gly Scr Lou lie Ser Pro Cys Trp Val lie Ser Ala Thr His Cys 
275 280 285 

L J he lie Asp Tyr Pro Lys Lys Glu Asp Tyr lie Val Tyr Leu Gly Arg 
290 295 300 



Ser Ax g Leu Asn Ser Asn Thr Gin Gly Glu Met LyS Phe Glu Val Glu 
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305 310 315 320 

Asn Leu lie Leu His Lys Asp Tyr Ser Ala Asp Thr Leu Ala His His 
325 330 335 

Asn Asp lie Ala Ley Leu Lys lie Arg Ser Lys Glu Gly Arg Cys Ala 
340 345 350 

Cln Pro Ser Arg Thr lie Gin Thr lie Cys Leu Pro Ser Met Tyr Asn 
355 " 360 365 

Asp Fro Gli: Phe G.l y Thr Ser Cys Glu lie Thr Gly Phe Gly Lys Glu 
370 375 330 

Asn Ser Thr Asp Tyr Leu Tyr Pro GlU Gin Leu Lys Met Thr Val Val 
355 390 395 400 

Lys T.eu T.1 e Ser His Arg Glu Cys Gin Gin Pro His Tyr Tyr Gly Ser 
405 410 415 

Glu Val Thr Thr T,y.«? Met Leu Cys Ala Ala Asp Pro Gin Trp Lys Thr 
420 425 430 

Asp Ser Cys Gin Gly Asp Ser Gly Gly Pro Leu Val Cys Ser Leu Gin 
435 " 440 <M5 

Gly Arq Met Thr Leu Thr Gly lie Val Ser Trp Gly Arg Gly Cys Ala 
4F>0 455 460 

Leu Lys Asp T.ys Pro Gly Val Tyr Thr Arg Val Ser His Phe Leu Pro 
465 470 475 480 

Trp lie Arq Sex His Thr Lys Glu Glu Asn Gly Leu Ala Leu 
485 490 



<21Q> 36 
<211> 285 
<P.12> PPT 

<213> Homo & dpi ens 
<400> 36 

Met Gin Mot Ser Pro Ala Leu Thr Cys Lsu Val Leu Gly Leu Ala Leu 
1 5 10 15 

Val Phe Gly Glu Gly Ser Ala Val His His Pro Pro Ser Tyr Val Ala 
20 25 30 

His Leu Ala Ser Asp Phe Gly Val Arg Val Phe Gin Gin Val Ala Gin 
35 4 0 0 5 

Ala Ser Lys Asp Arg Asn Val Val Phe Ser Pro Tyr Gly Val Ala Ser 
50 55 60 

Val Leu Ala Met Leu Gin Leu Thr Thr G-l y Gly G.I u Thr Gin Gin Gin 
65 70 "5 80 

Tie Gin Al a Ai/j Met Gly Phe Lys He Asp Asp Lys Gly Met Ala Pro 
35 90 95 



Ala Leu Arg His Lou Tyr Lys Glu Leu Met Gly Pro Trp Asn Lys Asp 
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ma 105 HQ 

Glu lie Ser Thr Tbr Asp Ala lie Phe Val Gin Arg Asp Leu Lys Leu 
115 120 125 

Val Gin G3 y Phe Met Pro His Phe Phe Arg Lsu Phe Arg Ser Thr Val 
130 135 140 

Lys Gin Val A&p Phe Scr Glu Val Glu Arg Ala Arg Phe lie lie Asn 
14 5 150 155 160 

Asp Trp Val Lys Thr His Thr Lys Gly Met lie Ser Asrt Leu Leu Gly 
165 1^0 175 

T.ys Gly Ala Val Asp Gin Leu Thr Arg Leu Val Urn Val Asn Ala Leu 
130 185 150 

Tyr Phe Asn Gly Gin Trp Lys Thr Pro Phe Pro Asp Ser Ser Thr His 
195 200 205 

Arg Arg Leu Phe His Lys Ser Asp Gly Ser Thr Val Ser Val Pro Met 
210 215 ' 220 

Met Ala Gin Thr Asn Lys Phe Asn Tyr Thr Glu Phe Thr Thr Pro Asp 
225 230 235 240 

Gly His Tyr Tyr Asp 1.1 e Leu Glu Leu Pro Tyr His Gly Asp Thr Leu 
245 250 255 

Ser Met Phe lie Ala Ala Asp Leu Val Pro Thr Glu Ala Leu Cys Arg 
260 265 270 

Met Glu Leu Arg Gly Leu Gin Glu L<eu Leu Cys Ala Trp 
275 280 285 



<210> 37 
<211> 399 
<212> PRT 

<213> Homo sapiens 
<400> 37 

Met- Gin Met Ser Pro Ala Leu Thr Cys Leu Val Leu Gly Leu Ala Leu 
1 5 10 15 

VaJ J?he Gly Glu Gly Ser Ala Val His His Pro Pro Ser Tyr Val Ala 
20 25 30 

His Leu Ala Ger Asp Phe Gly Val Arg Val Phe Gin Gin Vgl Ala Gin 
.35 40 45 

Ala Ser Lys Asp Axg Asn Val Val Phe Ser Pro Tyr Gly Val Ala Ser 
50 ^5 60 

Val Lei: Ala Met T,*u Cln Leu Thr Thr Gly Gly Glu Tbr Gin Gin Gin 
65 ' 70 75 B0 

He GJn Ala Aid Met Gly Phe Lys He Asp Asp Lys Gly Met Ala Pro 
35 90 95 

Ala Leu Arg His L«u Tyr Lys Glu Leu Met Gly Pro Trp Asn Lys Asp 
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100 105 HO 

Glu He Her Thr Thr Asp Ala He Phe Val Gin Arg Asp Leu Lys Leu 
.1 IS 120 125 

Val Gin Gly Fhe M^t Pro His Phe Phe Arg Leu Phe Arg Ser Thr Val 
130 135 14 0 

Lys Gin Val Asp Phe Ser Glu Val Glu Arg Ala Arg Phe He He Asn 
1^5 150 155 160 

Asp Trp VaI Lys Thr His Thr Lys Gly Met He Ser Asn Leu Leu Gly 
165 170 175 

LVS Glv Al* Val Asp Gin Leu Thr Arg Leu Val Leu Val Asn Ala Leu 
180 185 190 

Tyr Phe Asn Gly Gin Trp Lys Thr Pro Phe Pro Asp Ser Ser Thr His 
195 200 205 

Arq Arq Leu Phe His Lys 5er Asp Gly Ser Thr Val Ser Val Pro Met 
210 215 220 

M*t Ala Gin Thr Asn Lys Phe Asn Tyr Thr Glu Phe Thr Thr Pro Asp 
225 250 235 240 

Gly His Tyr Tyr Asp lie Leu Glu Leu Pro Tyr His Gly Asp Thr Leu 
ZAb 250 255 

Ser MP i Phe He Ala Ala Pro Tyr Glu Lys Glu Val Pro Leu Ser Ala 
260 265 270 

Leu Thr Asn He Leu Ser Ala Gin Leu He Ser His Trp Lys Gly Asn 
275 280 285 

Met Thr Arg Leu Pro Arg Leu Leu Val Leu Pro Lys Phe Ser Leu Glu 
290 295 300 

Thr Glu Vdl Asp Leu Arg Lys Pro Leu Glu Asn Leu Gly Met Thr Asp 
305 310 " 315 320 

MRt Phe *rg Gin Phe Gin Ala Asp Phe Thr Ser Leu Ser Asp Girl Glu 
325 330 335 

Pro Leu His Val Ala Gin Ala Leu Gin Lys Val Lys lie Glu Val Asn 
340 345 350 

Glu Ser Gly Thr Val Ale Ser Ser Ser Thr Ala Val He Val Ser Ala 
355 360 365 

Arq Mei- A3 a Pro Glu Glu He He Met Asp Arg Pro Phe Leu 3Phe Val 
" 370 375 380 

Val Pro Pro Gin Lys Gin Cys Ala Trp Val He Leu Glu Cys Arg 
385 390 



<210> 38 
<211> 317 
<212> PKT 

<213> Homo sapiens 
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<400;> 38 

Met Thr Aid Ma Ser Met Gly Pro Val Arg Val Ala Phe Val Val Leu 
! i., 10 15 

Leu Ala. Leu Cys ^cr Arg Pro Ala Val Gly Gin Asn Cys Ser Gly Pro 
20 25 30 

Cys Arg Cys Pro Asp Glu Pro Ala Pro Arg Cys Fro Ala Gly Val Ser 
35 40 45 

Leu Val Leu Asp Gly Cys Gly Cys Cys Arg Val Cys Ala Lys Gin Leu 
50 55 60 

Glv Glu Leu Cys Thr Glu Arg Asp Pro Cy$ Asp Pro His Lys Gly Leu 
65 70 75 80 

Phe Cys Asp Phe GXy Ser Pro Ala Asn Arg Lys He Gly Val Cys Thr 
35 ^0 35 

A3 a Lys Asp Gly Ala Pro Cys lie Fh® Gly Gly Thr Val Tyr Arg Ser 
100 105 HO 

uly Glu S<=r Phe Gin Ser Ser Cys Lys Tyr Gin Cys Thr Cys Leu Asp 
115 120 125 

Gly Ala Val Gly Cys Met Pro Leo Cys Ser Met Asp Val Arg Lsu Pro 
130 135 140 

Ser Pro Asp Cys Pro Leu Pro Leu Glu Asp Thr Phe Gly Pro Asp Pro 
145 " " 150 155 1-60 

Thr Mpt Hp Arq Ala Asn Cys Leu Val Gin Thr Thr Glu Trp Ser Ala 
1*55 170 175 

Cys Ser Lys Thr Cys Gly Met Gly lie Ser Thr Arg Val Thr Asn Asp 
' ISO 185 190 

Asn Ala f>er Cys Arg Leu Glu Lys Gin Ser Arg Leu Cys Met Val Arg 
195 200 205 

Pro Cys GJu Ser Asp Leu Glu Glu Asn II® Lys Lys Gly Lys Lys Cys 
2 iu 215 220 

lie Arq Thr Pro Lys lie Ser Lys Fro lie Lys Phe Glu Leu Ser Gly 
225 230 235 240 

Cys Thr S^r Met- Lys Thr Tyr Arg Ala Lys Phe Cys Gly Val Cys Thr 
245 250 255 

Asp Gly Arg Cys Cys Thr Pro His Arg Thr Thr Thr Leu Pro Val Glu 
260 265 270 

Phe Lys Cys Pro Asp Gly Glu Vy] Met LyS Lys Asn Met Met Phe lie 
z7 5 280 285 

Us Thr rya Ala CyS His Tyr Asn Cys Pro Gly Asp ASrt Asp He Phe 
230 295 300 

Glu Ser i.eu Tyr Xyr Arq Lys Met Tyr Gly Asp Met Ala 
305 310 315 
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<210> 39 
<212> PRT 

<21'3> Hqjtiq g&pieng 
<4 00> 3 9 

Asn Met Glu Asa Ser Leu Arg Cys Veil Trp Val Pre? Lys Leu Ala Phe 
1 5 XO 15 

Val Leu Pbe GJ y Ala Ser Lou Leu Ser Ala His Leu Gin Val Thr Gly 
20 25 30 

Phe Gin Tie Lys Ala Phe Thr Ala Leu Arg Pb® Leu Ser Glu Pro Ser 
35 40 45 

Asp Ala Val Thr Met Arg Gly Gly Asn Val Leu Leu Asp Cys Ser Ala 
50 55 60 

Glu Scr Asp Arg Gly Val Pro Val lie Lys Trp Lys Lys Asp Ala lie 
65 ~ 70 75 80 

His Leu Ala Leu Gly Met Asp Glu Arg Lys Gin Gin Leu Ser Asn Gly 
85 " ^0 95 

Ser Leu Leu lie Gin Asn lie Leu His Ser Arg His His Lys Pro Asp 

100 105 110 

Glu Gly Leu Tyr CJn Cys Glu Ala Ser Leu Gly Asp Ser Gly Ser lie 
Hfj 120 125 

lie Ser Arg Thr Ala Lys Val Ala Val Ala Gly Pro Leu Arg Phe Leu 
130 135 HO 

Ser Gin Thr Glu Ser Val Thr Ala Phe Met Gly Asp Thr Val Leu Leu 
145 150 155 160 

Lys Cys Glu V^i He Gly Glu Pro Met Pro Thr He His Trp Gin Lys 
165 170 3.75 

Asn Gin Gin Asp Leu Thr Pro He Pro Gly Asp Ser Arg Val Val Val 
180 185 190 

Leu Pro Ser Gly Ala Leu Gin He Ser Arg Leu Gin Pro Gly Asp He 
1.35 200 205 

ft] y He Tyr Arg Cys Ser Ala Arg A3n Pro Ala Ser Ser Arg Thr Gly 
210 " ?KS ' 220 

Asn Glu Ala Glu Val Arg lie Leu Ser Asp Pro Gly Lsu His Arg Gin 
225 230 235 240 

Leu Tyr Phe Leu Gin Arg fro Ser Asn val val Ala Xle Glu Gly Lys 
245 250 255 

Asp Ala Val Leu Glu Cys Cys Val Ser Gly Tyr Pro Pro Pro Ser Phe 
260 265 270 



Thr Trp Leu Arg Gly Glu Glu Val He Gin Leu Arg Ser Lys Lys Tyr 
275 " 230 285 
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Ses: Leu Leu Gly Gly Scr Asn 7->eu Leu lie Ser Asn Val Thr Asp Asp 
290 295 300 

Asp Ger Gly Met Tyr Thr Cys Val Val Thr Tyr Lys Asn GJ.u Asn lie 
305 310 315 320 

Ser Ala. Sex Ala Glu Leu Thr Val Leu Val lie lie Asp Lys Val Leu 
325 330 335 

Val Asp Thr Phe Trp Val 
340 



<210> 4 0 

£2ll> 143-3 

<212> PRT 

<213> Homo s dp lens 

<4 00> 40 

Asn Met Glu Asn Ser Leu Arg Cys Val Trp Val Pro Lys Leu Ala Phe 
1 5 " 10 15 

Val Leu Phe Gly Ala Ser Leu Leu Ser Ala His Leu Gin Val Thr Gly 
20 25 30 

Phe Gin II j* Lys Ala Phc Thr Ala Leu Arg Phe Leu Ser Glu Pro Ser 
35 40 45 

Asp Ala Val Thr Met Arg Gly Gly Asn Val Leu Leu Asp Cys Ser Ala 
50 55 60 

Glu Ser Asp Arg Gly Val Pro Val lie Lys Trp Lys Lys Asp Ala lie 
65 70 75 80 

His Leu Ala Leu Gly Met Asp Glu Arg Lys Gin Gin Leu Ser Asn Gly 
35 ' 90 95 

Ser Leu Leu lie Glu Asn lie Leu His Ser Arg His His Lys Prg Asp 
100 105 HO 

Glu Gly Leu Tyr Gin Cys Glu Ala Ser Leu Gly Asp Ser Gly Ser He 
115 120 125 

He Ser Axg Thr Ala Lys Val Ala Val Ala Gly Pro Leu Arg Phe Leu 
130 135 140 

Ser Gin Thr Glu Ser Val Thr Ala Phe Met Gly Asp Thr Val Leu Leu 
J 45 150 155 160 

Lys Cys Glu Val I.le Gly Glu Pro Met Pro Thr He His Trp Gin Lys 
165 170 175 

Asn Gin Gin Asp Leu Thr Pro He Pro Gly Asp Ser Arg Val Val Val 
180 185 ' 190 

Leu Pro Ser Gly Ala Leu Gin He Ser Arg Leu Gin Pro Gly A3p He 
195 200 205 



Gly He Tyr Arg Cys .Ser Ala Arg Asn Pro Ala Ser Ser Arg Thr Gly 
210 " 215 220 
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Asn Glu Ale Glu Val Arg Tie Leu Ser Asp Pro Gly Leu His Arg Gin 
225 230 235 240 

Leu Tyr Phe Leu Gin Arg Pro Ser Asn Val Val Ala lie Glu Gly Lys 
245 250 255 

Asp Ala Vn J Leu Glu Cys Cys Val Ser Gly Tyr Pro Pro Pro Ser Phe 
260 265 270 

Thr Trp Leu Arg Gly Glu Glu Val lis Gin Leu Arg Ser Lys Lys Tyr 
275 280 2S5 

Ser Leu L<?u Gly Gly Ser Asn Leu Leu lie Ser Asn Val Thr Asp Asp 
290 295 300 

Asp Ser Gly Mel: Tyr Thr Cys Val Val Thr Tyr Lys Asn Glu Asn lie 
305 310 31G 320 

Ser Ala Ser Ala Glu Leu Thr Val Leu Val Pro Pro Trp Phe Leu Asn 
325 330 335 

His Pro Ser Asn Leu Tyr Aid Tyr Glu Ser Met Asp lie Glu Phe Glu 
340 345 350 

Cys Thr Val Ser Gly Lys Pro Val Pro Thr Val Asn Trp Met Lys Asn 
355 " 360 355 

Gly Asp Val Val lie Pro Ser Asp Tyr Phe Gin lie Val Gly Gly Ser 
370 375 380 

Asn Leu Arg He Leu Gly Val Val Lys Ser Asp Glu Gly Phe Tyr Gin 
385 .*90 395 400 

Cys Val Ala Glu Asn Glu Ala Gly Asn Ala Gin Thr Ser Ala Gin Leu 
405 410 415 

He Val Pro Lys Pro Ala He Pro Ser Ser Ser Val Leu Pro Ser Ala 
420 425 430 

Pro Arg Asp val Val Pro Val Leu Val Ser Ser Arg Pne Val Arg Leu 
43i; 440 445 

Ser Trp Arg Pro Pro Ala Glu Ala Lys Gly Asn He Gin Thr Phe Thr 
4160 455 460 

Val Fhe Phe Ser Arg Glu Gly Asp Asn Arg Glu Arg Ala Leu Asn Thr 
465 * 470 " 475 480 

Thr Gin Pro Gly Ser Leu Gin Leu Thr Val Gly Asn Leu Lys Pro Glu 
48b 430 495 

Ala Met Tyr Thr Phe Arg Val Val Ala Tyr Asn Glu Trp Gly Pro Gly 
500 505 510 

Glu Ser Sor Gin Pro He Lys Val Ala Thr Gin Pro Glu Leu Gin Val 
515 520 525 



Pro Gly Pro Veil Glu Asn Leu Gin Ala Val Ser Thr Ser Pro Thr Ser 

530 ^35 540 
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lie Leu TJ<a Thr Trp Glu Pro Pro Ala Tyr Ala Asn Gly Pro Val Gin 
545 550 555 560 

Gly Tyr Arq Leu PVie Cys Thr Glu Val Ser Thr Gly Lys Glu Gin Asn 
565 S70 575 

lie Glu Val Asp Gly Lou Ser Tyr Lys Leu Glu Gly Leu Lys Lys Phe 
5f30 585 590 

Thr Glu Tyr Ser Leu Arg Phe Leu Ala Tyr Asn Arg Tyr Gly Pro Gly 
595 600 605 

Val Ser Thr Asp Asp He Thr Val. Val Thr Leu Ser Asp Val Pro Ser 
610 620 

Ala Pro Pro Gin Asn Val Ser Leu Glu Val Val Asn Ser Arg Ser He 
625 630 635 640 

Lys Val Ser Trp Leu Pro Pro Pro Ser Gly Thr Gin Asn Gly Phe lie 
645 650 655 

Thr Gly Tyr Lys He Arg His Arg Lys Thr Thr Arg Arg Gly Glu Met 
6(50 ^65 670 

Glu Thr Leu Glu Pio Asn Asn Leu Trp Tyr Leu Phe Thr Gly Glu 
675 680 635 

Lys Gly Ser Gin Tyr S*=r Phe Gin Val Ser Ala Met Thr Val Asn Gly 
690 " 695 700 

Thr Gly Pro Pro Ser Asn Trp Tyr Thr Ala Glu Thr Pro GlU Asn Asp 
705 710 715 720 

Leu Asp GJu Ser Gin Val Pro Asp Gin Pro Ser Ser Leu His Val Arg 
725 730 735 

Pro GJn Thr Asn Cys lie He Met Ser Trp Thr Pro Pro Leu Asn Pro 
740 745 750 

Asn He Veil Vol Arg Gly Tyr He He Gly Tyr Gly Val Gly Ser Pro 
7 !S5 760 765 

Tyr AJa Glu Thr Val Arg Val Asp Ser Lys Gin Arg Tyr Tyr Ser He 
770 775 790 

Glu Arg Leu Glu Ser Scr Ser His Tyr Val He Ser Leu Lys Ala Phe 
735 730 795 800 

A*n Asn Ala W y Glu Gly Val Pro Leu Tyr Glu Ser Ala Thr Thr Arg 
U0b 310 815 

Sor lie Thr Asp Pro Thr Asp Pro Val Asp Tyr Tyr Pro Leu Leu Asp 
320 32b 830 

Asp Ph^ Pro Tbr 3*r Val Pro Asp Leu Ser Thr Pro Met Leu Pro Pro 
835 840 345 

Val Gly Val Gin Ala Val Aid Leu Thr His Asp Ala Val Arg Val Ser 
350 855 360 

Trp Ala Asp Asn S^r Val Pro Lys Asn Gin Lys Thr Ser Glu Val Arg 
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865 370 875 880 

Leu Tyr Thr Val Arg Trp Arg Thr Ser Pne Ser Ala Ser Ala Lys Tyr 
885 890 895 

Lys Ser Glu Asp Thr Thr Ser Lou Ser Tyr Thr Ala Thr Gly Leu Lys 
900 905 910 

Pro Asn Thr Met Tyr Glu Phe Ser Val Met Val Thr Lys Asn Arg Arg 
f U!3 920 925 

Ser Ser Thr Trp Ser Met Thr Ala His Ala Thr Thr Tyr Glu Ala Ala 
330 935 940 

Pro Thr Ser Ala Pro Lys Asp Phe Thr Val lie Thr Arg Glu Gly Lys 
945 950 955 960 

Pro Arg Ala Val lie Val Ser Trp Gin Pro Pro Leu Glu Ala Asn Gly 
965 97Q 975 

Lys lie Thr Ala Tyr lie Leu Phe Tyr Thr Leu Asp Lys Asn He Pro 
980 935 990 

He* Asp Asp Trp I'l.e Met Glu Thr He Ser Gly Asp Arg Leu Thr His 
995 1000 1005 

Gin lie Met Asp L^u Asn Leu Asp Thr Met Tyr Tyr Phe Arg lie Gin 
1010 1015 1020 

Ala Arg Asn Ser Lys Gly Val Gly Pro Leu Ser Asp Pro lie Leu J?be 
1025 1030 1035 1040 

Arg Thr Leu Lys Vdl Glu His Pro Asp Lys Met Ala Asn Asp Gin Gly 
KM5 "l050 1055 

Arg His CJ y Asp GJ y Gly Tyr Trp Pro Val Asp Thr Asn Leu lie Asp 
1060 1065 1070 

Arg Scr Thr Lou Asn Glu Pro Pro He Gly Gin Met Hi3 Pro Pro His 
1075 1080 1035 

Gly Ser VaJ, Thr Pro G.) n Lys Asn Ser Asn Leu Leu Val He He Val 
1090 1095 1100 

Val Thr Val Gly Val lie Thr Val Leu Val Val Val He Val Ala Val 
1105 1110 1115 1120 

lie Cys Thr Arg Arg Scr Scr Ala Gin Gin Arg Lys LyS Arg Ala Thr 
1125 H30 1135 

His Ser Als Gly Lys Arg Lys Gly Ser Gin Lys Asp Leu Arg Pro Pro 
1140 1145 1150 

Asp Leu Trp He Ills His Glu Glu Met Glu Met Lys Asn. He Glu Lys 
1155 1160 1165 

Pro Ser Gly Thr Asp Pro Ala Gly Arg Asp Ser Pro He Gin Ser Cys 
1170 1175 1180 



Gin Asp Leu Thr Pro Val Ser His Ser Gin Ser Glu Thr Gin Leu Gly 
1185 1190 1195 1200 
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Ser Lys Sftr Thr Ser His Ser Gly Gin Asp Thr Glu Glu Ala Gly Ser 
1205 1210 1215 

3er Met Ser Thr Leu Glu Arg Ser Leu Ala Aid Arg Arg Ala Pro Arg 
1220 " 1225 1230 

Ala Lys Leu Met He Pro Met Asp Ala Gin Ser Asn Asn Pro Ala Val 
1235 1240 1245 

Val Ser Ala He Pro Val Pro Thr leu Glu Ser Ala Gin Tyr Pro Gly 
1250 1255 1260 

11$ Leu Pro Her Pro Thr Cy& Gly Tyr Pro His Pro Gin Phe Thr Leu 
1265 1270 ' " 1275 1280 

Arg Pro Val Pro Phe Pro Thr Leu Ser Val Asp Arg Gly Phe Gly Ala 
12»ii 1290 1295 

Gly Arg Ser Gin Ser Val Ser Glu Gly Pro Thr Thr Gin Gin Pro Pro 
1300 1305 1310 

Met Leu Pro Pro Ser Gin Pro Glu His Ser Ser Ser Glu Glu Ala Pro 
1315 1320 1325 

Ser Arg Thr He Pro Thr Ala Cys Val Arg Pro Thr His Pro Leu Arg 
1330 1335 1340 

Sor Phe Ala Asn Pro Leu Leu Pro Pro Pro Met Ser Ala He Glu Pro 
1345 1350 1355 1360 

Lvs Val Pro Tyr Thr Pro Leu Leu Ser Gin Pro Gly Pro Thr Leu Pro 
1365 1370 1375 

Lys Thr His v a ] Lys Thr Ala Ser Leu Gly Leu Ala Gly Lys Ala Arg 
1380 1335 1330 

Ser Pro Leu Leu Pro val Ser Val Pro Thr Ala Pro Glu Val Ser Glu 
3 395 1400 1405 

Glu Ser ilia Lys Pro Thr Glu Asp Ser Ala Asn Val S©r Ala Ser Leu 
14J0 1415 1420 

Lys Phe Met Leu His Gin Gly Thr Asp 
1425 14 30 



<210> 41 
<211> 365 
<212> PBT 

<213> Homo sapiens 
<400> 41 

Met Pro Gly Lys Arg Gly Leu Gly Trp Trp Trp Ala Arg Leu Pro Leu 
1 5 10 15 

Cys Leu Leu Leu Ser Leu Tyr Gly Pro Trp Met Pro Ser Ser Leu Gly 
20 ' 25 30 



Lys Pro Lys Gly His Pro His Met Asn Ser lie Arg He Asp Gly Asp 
35 40 45 
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He Thr Leu Gly Gly Leu Phe Pro Val His Gly Arg Gly Ser Glu Gly 
50 55 60 

Lys Pro Cyn Gly Glu Leu Lys Lys Glu Lys Gly I Is His Arg Leu Glu 
65 ' 70 75 30 

Ala Met Leo Phe Ala Leu Asp Arg He Asn Asn Asp Pro Asp Leu Leu 
85 90 95 

Pro Asn lie Thr Leu Gly Ala Arg He Leu Asp Thr Cys Ser Arg Asp 
100 105 110 

Thr His Ala Leu Glu Gin Ser Leu Thr Pbe Val Gin Ala Leu He Glu 
115 120 125 

Lys Asp Gly Thr Glu Val Arg Cys Gly Ser Gly Gly Pro Pro He He 
130 135 140 

Thr Lys Pro Glu Arg Val Val Gly Val He Gly Ala Ser Gly Ser Ser 
145 150 155 160 

Val Sfir He Met Val Ala Asn He Leu Arg Leu Phe Lys He Pro Gin 
165 170 175 

He Ser Tyr Ala Ser Thr Ala Pro Asp Leu Ser Asp Asn Ser Arg Tyr 
100 185 190 

Asp Phe Phe Ser Arg Val Val Pro Ser Asp Thr Tyr Gin Ala Gin Ala 
19b * 200 205 

Mat Val Asp He Vel Arg Ala Leu Lys Trp Asn Tyr Val Ser Thr Val 
210 215 220 

Ala Ser Glu Gly Ser Tyr GJy Glu Ser Gly Val Glu Ala Phe He Gin 
225 ' 230 235 240 

Lys Ser Arg Glu Asp Gly Gly Val Cys He Ala Gin Ser Val Lys He 
24 5 250 255 

Pro Arg Glu Pro Lys Ala Gly Glu Phe Asp Lys He He Arg Arg Leu 
260 ' 265 270 

Leu Gin Thr Ser Asn Ala Arq Ala Val He He Phe Ala Asn Glu Asp 
275 ' 230 285 

Asp He Arg Arg VaJ Leu Glu Ala Ala Arg Arg Ala Asn Gin Thr Gly 
290 " 295 300 

His Phe Phe Trp Met Gly Ser Asp Ser Trp Gly Ser Lys He Ala Pro 
505 310 315 320 

Val Leu His T&u Glu Glu Val Ala Glu Gly Ala Val Thr He Leu Pro 
325 330 335 

Lys Arg Me'!. Scr Val Arg Asp Arg Glu Arg He Gly Gin Asp Ser Ala 
3AH 345 350 

Tyr Glu «] n Glu Gly Lys Val Gin Phe Val He Asp Ala Val Tyr Ala 
355 360 365 
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Met Gly Hia A.U Lrv His Ala Met His Arg Asp Leu Cyst Pro Gly Arg 
370 .575 300 

Val Gly Leu Cys Pro Arg Mot Asp Pro Val. Asp Gly Thr Gin Leu Leu 
385 390 395 400 

Lys Tyr 11^ Arg Asn v^l Asn Phe Ser Gly lie Ala Gly Asn Pro Val 
401) 410 415 

Thr Phe Asn Glu Asn Gly Asp Ala Pro Gly Arg Tyr Asp lie Tyr Gin 
420 425 4 30 

Tyr Gin Lea Arg Asn Asp Sar Ala Glu Tyr Lys Val He Gly Ser Trp 
43b 440 445 

Thr Asp His Leu His Leu Arg He Glu Arg Met His Trp Pro Gly S<sr 
450 4 5 5 4 60 

Gly Gin Gin Leu Pro Arg Ser lie Cys Ser Leu Pro Cys Gin Pro Gly 
465 00 475 430 

Glu Arg Lys Ly.7 Thr Val Lys Gly Met Pro Cys Cys Trp His Cys Glu 
485 490 495 

Pro Cys Thr Gly Tyr Gin Tyr Gin Val Asp Arg Tyr Thr Cys Lys Thr 
500 505 510 

Cys Pro Tyr Asp Met Arg Pro Thr Glu Asn Arg Thr Gly Cys Arg Pro 
515 520 $25 

Tie Pro He He Lys Leu Glu Trp Gly Ser Pro Trp Ala Val Leu Pro 
530 535 ' 540 

Leu Phe Lftu Ala Val Val Gly lie Ala Ala Thr Leu Phe Val Val He 
545 550 555 560 

Thr Phe Val Arg Tyr Asn Asp Thr Pro He Val Lys Ala Ser Gly Arg 
565 570 575 

Glu Leu Sqt Tyr Val Leu Leu Ala Gly He Phe Leu Cys Tyr Ala Thr 
530 585 590 

Thr Phe T.eu Met He Ala Glu Pro Asp Leu Gly Thr Cys Ser Leu Arg 
595 600 605 

Arg lie Phe Leu G.l y Leu Gly Met Ser He Ser Tyr Ala Ala Leu Leu 
63 0 615 620 

Thr Lys Thr Aon Arg He Tyr Arg He Phe Glu Gin Gly Lys Arg Ser 
625 " 630 $35 640 

Val Ser Ala Pro Arcj Phe He Ser Pro Ala Ser Gin Leu Ala He Thr 
Gib 650 655 

Phe Ser Leu Ho Ser Leu Gin Leu Leu Gly He Cys Val Trp Phe Val 
f>60 605 670 

Val Asp Pro 5«r His Ser Val Val Asp Phe Gin Asp Gin Arg Thr Leu 
67 3 630 $8 5 



Asp Pro Arq Phe Ala Arg Gly val Leu Lys Cys Asp He Ser Asp Leu 
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690 695 700 

Ser Leu lie Cys Leu Lea Gly Tyr Ser Met Leu Leu Met Val Thr Cys 
705 710 715 720 

Thr Val Tyr Ala lie Lys Thr Arg Gly Val Pro Glu Thr Phe Asn Glu 
725 730 735 

Ala Lys Pro T Le Gly Phe Thr Met Tyr Thr Thr Cys He Val Trp Leu 
740 745 750 

Ala Phe lie Pro lie Phe Phc Gly Thr Ser Gin Ser Ala Asp Lys Leu 
7 55 7 60 7 65 

Tyr He Gin Thr Thr Thr Leu Thr Val Ser Val Ser Leu Ser Ala Ser 
770 775 780 

Val Ssr Leu Gly Met Leu Tyr Met Pro Ly3 Val Tyr He He Leu Phe 
785 790 795 800 

His Pro Glu Gin Asn Val Pro Lys Arg Lys Arg Ser Leu Lys Ala Val 
805 310 815 

Val Thr Ala Ala Thr Mel: Ser Asn Lys Phe Thr Gin Lys Gly Asn Phe 
820 825 330 

Arg Pro Asn Gly G Lu Ala Lys Ser Glu Leu Cys Glu Asn Leu Glu Ala 
335 34Q S45 

Pro Ala Leu Ala Thr Lys Gin Thr Tyr Val Thr Tyr Thr Asn His Ala 
850 355 860 

He 

8 65 



<2l0> A2 
<211> 845 
<212> PRT 

<213> Homo sapiens 
<400> 42 

Met Glu Thr Lys Gly Tyr His Ser Leu Pro Glu Gly Leu Asp Met Glu 
1 5 10 15 

Arg Arg Trp Gly Gin Val Ser Gin Ala Val Glu Arg Ser Ser Leu Gly 
20 25 30 

Pro Thr Glu Arg Thr Asp Glu Asn Asn Tyr Met Glu He Val A3n Val 
35 " 40 45 

Ser Cys Val Ser Gly A!o He Pro Asn Asn Ser Thr Gin Gly Ser Ser 
50 -55 60 

Lys Glu Lys Gin Glu Lsu Leu Pro Cys Leu Gin Gin Asp Asn Asn Arg 
65 70 75 80 

Pro Gly lie Leu Thr 3<sr Asp Tie Lys Thr Glu Leu Glu Ser Lys Glu 
8£ SO 95 



Leu Ser Ala 'i'hr Val Ala Glu Ser Met. Gly Leu Tyr Met Asp Ser Val 
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100 105 no 

Arg Asp Ala Asp Tyx s<=r Tyr Glu Gin Gin Asn Gin Gin Gly Ser Met 
115 120 125 

Ser Fro Ala Lys II Tyr Gin Asn Val Glu Gin Leu Val Lys Phe Tyr 
130 135 140 

Lys Gly Ann Gly fiis Aug Pre? Ser Thr Leu Ser Cys Val Asn Thr Pro 
145 150 155 160 

Leu Arg Ser Phe Mel Scr Asp Ser Gly Ser Ser Val Asn Gly Gly Val 
165 170 175 

Met Arg Ala lie Val Lys Ser Pro 11© Met Cys His Glu Lys Ser Pro 
180 185 190 

Ser Val Cys Ser Pro Leu Asn Met Thr Ser Ser Val Cys Ser Pro Ala 
195 200 205 

Gly He Asn Ser Val Ser Ser Thr Thi Ala Ser Phe Gly 3<sr Phe Pro 
210 215 220 

Val His Ser Pro Tie Thr Gin Gly Thr Pro Leu Thr Cys Ser Pro Asn 
225 230 235 240 

A) a Glu Asn Arg Gly Ser Arg Ser His Ser Pro Ala His Ala Ser Asn 
245 250 255 

Val Gly Ser Pre Leu Ser Ser Pro Leu Ser Ser Met Lys Ser Ser lie 
260 265 210 

Ser S^r Pro Pm Ser His Cys Ser Val Lys Ser Pro Val Ser Ser Pro 
275 280 285 

Asn Asn Val Thr Leu Arg Ser Ser Val Ser Ser Pro Ala Asn He Asn 
290 295 300 

Asn Sw Arg Cys Ser Val Ser Ser Pro Ser Asn Thr Asn Asn Arg Ser 
305 ' 310 315 320 

Thr Leu S*r Ser Pro Ala Ala Ser Thr Val Gly Ser He Cys Ser Pro 
325 330 335 

Val Asn Asn Ala Phe Ser Tyr Thr Ala Ser Gly Thr Ser Ala Gly Ser 
34 0 345 350 

Ser Thr Leu Arg Asp Val Val Pro Ser Pro Asp Thr Gin Glu Lys Gly 

' 355 360 365 

Ala Gin Glu Val Pro Phe Pro Lys Thr Glu Glu Val Glu Ser Ala He 
370 375 380 

Ser Asn G*J y Val Thr Gly Gin Leu Asn He Val Gin Tyr He Lys Pro 
385 390 395 400 

Glu Pro Asp Gly Ala Phe Ser Sor Ser Cys Leu Gly Gly Asn Ser Lys 
405 410 415 

He Ash Sor Asp Ser Ser Phe Ser Val Pro He Lys Gin Glu Ser Thr 
420 425 430 
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Lys His Ser Cys Ser Gly Thr Ser Phe Lys Gly Asn Pro Thr Val Asn 
435 440 445 

Pro Phe Pro Phe Met Asp Gly Ser Tyr Phe S*r Phe Met Asp Asp Lys 
4S>0 455 460 

Asp Tyr Tyr Sor Leu Ser Gly He Leu Gly Pro Pro Vai Pro Gly Ph* 

470 475 480 

Asp Gly Ami Cys Git; Gly Ser Gly Phe Pro Val Gly He Lys Gin Glu 
485 490 495 

Pro Asp Asp Gly Gcr Tyr Tyr Pro Glu Ala Set He Pro Ser Ser Ala 
.400 50b 510 



He Val Gly Val Asn Ser Gly Gly Gin Ser Phe His Tyr Arg He Gly 
515 520 525 



Ala Gin Gly Thr He Ser Leu Ser Arg Ser Ala Arg Asp Gin Ser Phe 
■ l j30 535 5*3 0 

Gin His Leu Ser Ser phe Pro Pro Val Asn Thr Leu Val Glu Ser Trp 
545 550 555 560 

Lys Ser His Gly Asp Leu Ser Ser Arg Arg Ser Asp Gly Tyr Pro Val 
565 570 ' 575 

Leu Glu Ty* He Pro Glu Asn Val Ser Ser Ser Thr Leu Arg Ser Val 
500 585 590 

Ser Thr Gly Ser Ser Arg Pro Ser Lys He Cys Leu Val Cys Gly Asp 
591? 600 605 

Glu Ala Ser Gly Cys His Tyr Gly Val Val Thr Cys Gly Ser Cys Lys 
610 615 620 

Val Phe Phe Lys Arg Ala Val Glu Gly Gin His Asn Tyr Leu Cys Ala 
625 630 635 640 

Gly Arg Asn Asp Cys He lie Asp Lys He Arg Arg Lys Asn Cys Pro 
645 650 655 

Ala Cys Arq Leu Gin Lys Cys Leu Gin Ala Gly Met Asn Leu Gly Ala 
660 665 " 670 

Arg Lys Ser Lys Lys Leu Gly Lys Leu Lys Gly He His Glu Glu Gin 
675 680 685 

Pro Gin Gin Gin Gin Pro Pro Pro Pro Pro Pro Pro Pro Gin Ser Pro 
690 695 700 

Glu Glu Gly Thr Thr Tyr He Ala Pro Ala Lys Glu Pro Ser Val Asn 
705 710 715 720 

Thr Ala Leu Val Pro Gin Leu Ser Thr He Ser Arg Ala Leu Thr Pro 
725 730 735 

Ser Pro Val Mot Val Leu Glu Asn He Glu Pro Glu He Val Tyr Ala 
740 745 750 
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Gly Tyr Asp Ser Ser 
755 

Leu Asn Arg Leu Alo 
770 

Lys VaJ. Leu Pro Gly 
785 

Leu lie G] n Tyr Ser 
80b 

Arg Ser Tyr Lys His 
820 

Leu Vdl Phe Asn Glu 
835 



Lys Pro Asp Thr Ala 
760 

Gly Lys Gin Met lie 
775 



Phe Lys Asn Leu Pro 
790 

Trp Met Cys Leu Ser 

bio 

Thr Asn Ser Gin Phe 
825 

Leu Leu Ala Arq Val 
340 



Glu Asn Leu Leu Ser Thr 
765 

Gin Val Val Lys Trp Ala 
780 

Leu Glu Asp Gin He Thr 
795 ' aoo 

Ser Phe Ala Leu Ser Tr-D 
815 

Leu Tyr Phe Ala Pro Asp 
830 

Arg G.lU Gly 
845 



<210> 43 
<211> 837 
<212> PRT 

<213> Homo sapiens 
<4Q0> 43 

Met Glu Thr Lys Gly Tyr His Ser Leu Pro Glu Gly Leu Asp Met Glu 
15 10 15 

Arg Arg Trp Gly Gin Val Scr Cln Ala Val Glu Arg Ser Ser Leu Gly 
20 25 30 

Pro Thr Glu Arg Thr Asp Glu Asn Asn Tyr Met Glu lie Val Asn Val 
:s. i ;> 4 0 45 

Ser Cys Vyl Ser Gly Ala He Pro Asn Asn Ser Thr Gin Gly Ser Ser 
50 55 60 

Lys Glu Lys Gin Glu Leu Leu Pro Cys Leu Gin Gin A3p Asn Asn Arg 
65 70 75 SO 

Pro Gly Ho. Lsu Thr Ser Asp lie Lys Thr Glu Leu Glu Ser Lys Glu 
85 90 95 

Leu Ser Ala Thr Val Ala Glu Ser Met Gly Leu Tyr Met Asp Ser Val 
100 105 110 

Arg Asp Ala. Asp Tyr Ser Tyr Glu Gin Gin Asn Gin Gin Gly Ser Met 
115 120 125 

Ser Pro Ala Lys He Tyr Gin Asn Val Glu Gin Leu Val Lys Phe Tyr 
130 135 140 

Lys Gly Asn Gly Kis Arg Pro Ser Thr Leu Ser Cys Val Asn Thr Pro 
145 150 155 160 

Leu Arg Ser Phe Met Ser Asp Ser Gly Ser Ser Val Asn Gly Gly Val 
165 170 175 

Met Arg Ala He Val Lys Ser Pro He Met Cys His Glu Lys Ser Pro 
180 185 190 
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Ser Val Cys Ser Pro Lgu Asn Met Thr Ser Ser Val Cys Ser Pro Ala 
195 200 205 

Gly He Asn ftor Val Ser Ser Thr Thr Ala Ser Phe Gly Ser Phe Pro 
210 215 220 

Val His Scr Pro He Thr Gin Gly Thr Pro Leu Thr Cys 5sr Pro Asn 
225 230 235 240 

Ala Glu Asn Arg Gly Ser Aiq Ser His Ser Pro Ala His Ala Ser Asn 
2-05 250 255 

Val Gly Sftr Pro Leu Ser Ser Pro Leu Ser Ser Met Lys Ser Ser He 
260 265 270 

Ser Ser Pro Pro Ser His Cys Ser Val Lys Ser Pro Val S©r Ser Pro 
275 280 285 

Asn Asn Val Thr L/?u Arg Her Ser Val Ser Ser Pro Ala Asn He Asn 
290 ' 295 "300 

Asn Ser Arg Cys Ser Va 1 Ser Ser Pro Ser Asn Thr Asn Asn Arg Ser 
305 'UO 315 320 

Thr Leu Ser Ser Pro Ala Ala Ser Thr Val Gly Ser He Cy3 Ser Pro 
.-525 330 335 

Val Asn Asn Ala Pho Ser Tyr Thr Ala Ser Gly Thr Ser Ala Gly Ser 
340 345 350 

Ser Thr T.eu Arg Asp Val Val Pro Ser Pro Asp Thr Gin Glu Lys Gly 
355 360 365 

A3 a Gin Glu Val Pro Phe Pro Lys Thr Glu Glu Val Glu Ser Ala He 
370 375 330 

Ser Asn Gly Val Thr Gly Gin Leu Asn He Val Gin Tyr He Lys Pro 
395 390 335 400 

Glu Pro Asp Gl,y Ala Phe Ser Ser S^r Cys Leu Gly Gly Asn Ser Lys 
4 0b 410 415 

IJe Asn Ser Asp Ser Ser Phe Ser Val Pro He Lys Gin Glu Ser Thr 
420 4 25 430 

Lys His Ser Cyn Ser Gly Thr Ser Phe Lys Gly Asn Pro Thr Val Asn 
435 440 445 

Pro Phe Pro Phe Met Asp Gly Ser Tyr Phe Ser Phe Met Asp Asp Lys 
450 455 460 

Asp Tyr Tyr Ser Leu Ser Gly He Leu Gly Fro Pro Val Pro Gly Phe 
465 4/0 475 430 

Asp Gly Asn Cys G'l.u Gly Ser Gly Phe Pro Val Gly He Lys Gin Glu 
485 490 495 

Pro Asp Asp Gly Ser Ty^ Tyr Pro Glu Ala Ser He Pro Ser Ser Ala 
500 505 510 



He Val Gly Val 7\sn Se*r C41y Gly Gin Ser Phe His Tyr Arg He Gly 
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515 320 525 

Ala Gin. Cly Thr He Ser Leu Ser Arg Ser Ala Arg Asp Gin Ser Phe 
530 535 " 540 

Gin His Leu Ser Ser Phe Pro Pro Val Asn Thr L$u Val Glu Ser Trp 
550 555 560 

Lys ser His Sly Asp Leu Ser Ser Arg Arg Ser Asp Gly Tyr Pro Val 
Shi) 570 ' 575 

Leu Glu Tyr lie Pro Glu Asn Val Ser Ser Ser Thr Leu Arg Ser Val 
590 585 590 

Ser Thr Gly Ser Ser Arg fro Ser Lys He Cys Leu Val Cys Gly Asp 
59!* " 600 605 

Glu Ala Ser Gly Cy3 His Tyr Gly Val Val Thr Cys Gly Ser Cys hya ' 
63 0 615 620 

Val Phe Phe Lys Arg Ala Val Glu Gly Gin His Asn Tyr Leu Cys Ala 
625 630 635 640 

Gly Arg Asn Asp Cys He He Asp Lys tie Arg Arg Lys Asn Cys Pro 
645 650 655 

Ala Cys Arg Leu Gin Lys Cys Leu Gin Ala. Gly Met Asn Leu Gly Ala 
660 665 670 

Arg Lys Ser I.ys Lys Leu Gly Lys Leu Lys Gly He His Glu Glu Gin 
67Tj 680 685 

Pro Gin Gin Gin Gin Pro Pro Pro Pro Pro Pro Pro Pro Gin Ser Pro 
630 695 700 

Glu Glu Cly Thr Thr Tyr lie Ala Pro Ala Lys Glu Pro Ser Val Asn 
705 710 715 720 

Thr Ala Leu Val Pro Gin Leu Ser Thr He Ser Arg Ala Leu Thr l?ro 
725 730 735 

Ser Pro Val Met Val Leu Glu Asn He Glu Pro Glu He Val Tyr Ala 
740 745 750 

Gly Tyr Asp Ser Ser Lys Pro Asp Thr Ala Glu Asn Leo Leu Ser Thr 
755 760 76*5 

Leu Asn Arq Leu Ala Gly Lys Gin Met lie Gin Val Val Lys Trp Ala 
770 775 780 

Lys Val Leu Pro Gly Phe Lys Asn Leu Pro Leu Glu Asp Gin He Thr 
785 790 795 800 

Leu He Gin Tyr S?r Trp Mfit Cys Leu Ser Ser Phe Ala Leu Ser Trp 
&05 310 815 

Arq Ser Tyr Lys His Thr Asn Ser Gin Phe Leu Tyr Phe Ala Pro Asp 
820 825 330 

Leu Val Phe Asn Glu 
8 35 
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<210> 44 

<2U> 64 0 

<212> PRT 

<213> Homo sapiens 

*400> 44 

Met Gly Arg Leu Gin Leu Val Val Leu Gly Leu Thr Cys Cys Trp Ala 
1 5 10 15 

Val Ala Ser Ala Ala Lys Leu Gly Ala Val Tyr Thr Glu Gly Gly Phe 
20 25 30 

Val Glu Gly Val Asn Lys Lys Leu Gly 7^eu Leu Gly Asp Ser Val Asp 
35 40 45 

lie Phe Lys Gly He Pro Phe Ala Ala Pro Thr Lys Ala Leu Glu Asn 
50 55 50 

Pro Gin Pro His k 3 ra Gly Trp Gin Gly Thr Leu Lys Ala. Lys Asn Phe 
65 70 75 $0 . 

Lys Lys Arq Cys Lou Gin Ala Thr He Thr Gin Asp Ser Thr Tyr Gly 
85 90 95 

Asp Glu Asp Cys Leu Tyr Leu Asn He Trp Val Pro Gin Gly Arg Ly3 
100 105 110 

Gin Val Ser Arg Asp L*=u Pro Va] Met He Trp lie Tyr Gly Gly Ala 
115 120 * 125 

Phe Leu Met Gly Ser Gly His Gly Ala Asn Phe Leu Asn Asn Tyr Leu 
130 13* 140 

Tyr Asp Gly Glu Glu lie Ala Thr Arg Gly Asn Val He Val Val Thr 
145 150 155 160 

Phe Asn Tyr Arg Val Gly Pro Leu Gly Ph© Leu Ser Thr Gly Asp Ala 
165 170 175 

Asn Leu Pro Gly Asn Tyr Gly Leu Arg Asp Gin His Met Ala He Ala 
190 185 190 

Trp Val Lys Arg Asn 1]© Ala Ala Phe Gly Gly Asp Pro Asn Asn. He 
195 200 205 

Thr Leu Phe Gly Glu Ser Ala Ciy Gty Ala Ser Val Ser Leu Gin Thr 
210 235 220 

Leu Ser t'ro Tyr Asn Ly$ Gly Leu lie Arg Arg Ala 11$ Ser Gin Ser 
225 230 235 240 

Gly Val Ala Leu Ser Pro Trp Val He Gin Lys Asn Pro Leu Phe Trp 
245 ' 250 255 

Ala Lys Lys Val A.la Glu Lys Val Gly Cys Pro Val Gly Asp Ala Ala 
260 265 270 



Arg Met Aid Gin Cys Leu Ly& Val Thr Asp Pro Arg Ala Leu Thr Leu 
275 280 295 
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Ala Tyr Lys Val Pro Leu Ala Gly Leu Glu Tyr Pro Met Leu His Tyr 
2<i0 295 300 ' . 

Val Gly Ph9 Val Pro Val lie Asp Gly Asp Phe He Pro Ala Asp Pro 
305 310 315 320 

TIg Asn Leu Tyr A,l a Asn. Ala Ala Asp He Asp Tyr He Ala Gly Thr 
325 330 335 

Asn Asi) Met Asp Gly His He Phe Ala Ser He Asp Met Pro Ala He 
340 345 350 

Asn Lys Gly Asn Lys Lys Val Thr Glu Glu Asp Phe Tyr Lys Leu Val 
355 360 " 365 

Ser Glu Pb<» Thr He Thr Lys Gly Leu Arg Gly Ala Lys Thr Thr Phe 
370 375 * 380 

Asp Val Tyr Thr Glu Ser Trp Ala Gin Asp Pro Ser Gin Glu Asn Lys 
385 330 395 400 

Lys Lys Thr VdJ Val Asp Phe Glu Thr Asp Vsl Leu Phe Leu Val Pro 
405 410 415 

Thr Glu He Ala Leu Ala Gin His Arg Ala Asn Ala Lys Ser Ala Lys 
420 425 430 

Thr Tyr Ala Tyr Leu Phe. Ser His Pro Ssr Arg Met Pro Val Tyr Pro 
435 440 445 

Lys Trp Val Gly Ala Asp His Ala Asp A3p lie Gin Tyr Val Phe Gly 
450 455 460 

Lys Pro Pho Ala Thr Pro Thr Gly Tyr Arg Pro Gin Asp Arg Thr Val 
465 470 475 ^ 430 

Scr Lys Ala Met lis Ala Tyr Trp Thr Asn Phe Ala Lys Thr Gly Asp 
4135 490 495 

Pro Asn Met Gly Asp Ser Ala Val Pro Thr His Trp Glu Pro Tyr Thr 
500 505 510 

Thr Glu Asn Ser Gly Tyr Leu Glu He Thr Lys Lys Met Gly Ser ser 
515 520 525 

Ser Met Lys Arg Ser Leu Arg Thr Asn Phe Leu Arg Tyr Trp Thr Leu 
530 535 540 

Thr Tyr Leu Ala Leu Pro Thr Val Thr Asp Gin Glu Ala Thr Pro Val 
545 550 555 560' 

Pro Pro Thr Gly Asp Ser Glu Ala Thr Pro Val Pro Pro Thr Gly Asp 
565 570 575 

Ser Glu Thr Ala Pro Val Pro Pro Thr Gly Asp Ser Gly Ala Pro Pro 
5R0 585 " 590 



Val Pro Pro Thr Gly Asp Ser Gly Ala Pro Pro Val Pro Pro Thr Gly 
59b 600 605 
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Asp Ser Gly Aid Pro Pro Val Pro Pro Thr Gly Asp Ser Gly Ala Pro 

610 620 

Pro V?l Pro Pro Thr Gly Asp Ser Gly Ala Pro Pro Val Pro Fro Fro 

625 630 635 640 



<-2 10 > 4 5 

<211> ?t)l 

<212> PHT 

<213-> Homo ^api-cns 

<4 00> 4 5 

Met Arq A3 ri Leu Leu Ala Arq Leu L<au Leu Cys Val Leu Val Val Ser 

1 * .\) 10 15 

Asp Ser Lys Gly Ssr Asn Glu Leu His Gin Val Pro Ser Asn Cys Asp 
20 30 

Cys Leu Asn Gly Gly Thr Cys Val Sar Asn Lys Tyr Fhe Ser Asn He 
3!i 40 45 

His Trp Cys Asn Cys Pro Lys Lys Phe Gly Gly Gin Ris Cys Glu He 
50 55 60 

Asp Lys Ser T.yn Thr Cys Tyr Glu Gly Asn Gly His Phe Tyr Arg Gly 
65 * ' 70 75 30 

Lyfe Ala Ser Thr Assp Thr Met Gly Arq Pro Cys Leu Pro Trp Asn Ser 
HI, 90 95 

Ala Thr Val Leu Gin Gin Thr Tyr His Ala His Arg Ser Asp Ala Leu 
100 10b 110 

Cln Leu G] y T.*»u Gly Lys His Asn Tyr Cys Arg Glu. Val Giy Ala Gin 
115 120 125 

Gly Pro Lyb Ala Leu Pro Thr Val Pro Arg Asn Leu Val Thr He Pro 
130 135 140 

Phe S**£ Cln Arq Aid Gly His Ser Thr Arg Glu Val Gin Pro Leu Val 
145 * 150 155 160 

Glu S«r Ser Leu Arg Gly Gly Gly Arg Glu Gly Pro Leu Gly Trp Asn 
165 170 175 

Asp He L'ro Tvr Leu Ser Val Leu Pro Gly Thr Gin Thr Thr Giy Gly 
190 iaii 130 

Asp Pro Gly Alo Met CyS Arg Trp Ala 



<210> 4 6 
<2M> 7 4 
*212> L'KT 

<213> Homo sapiens 
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<400> 46 

Met Lys Thr Tyi Arg AJ ,1 Lys Phe Cys Gly Val Cys Thr Asp Gly Arg 
1 b 10 15 

Cys Cys Thr Pro His Ary Thr Thr Thr Leu Pro VaJ Glu Phe Lys Cvs 
20 2h 30 

Pro Asp Gly Glu Val Met Lys Lys Asm Met Met Phe He Lys Thr Cys 
35 40 45 

Ala Cys Iiis Tyr Asn Cys Pro Gly Asp Asa Asp He Phe Glu Ser L®u 
50 55 ' 60 

Tyr Tyr Arg Lys Met Tyr Gly Asp Met Ala 
65 /0 



<210> 4? 
<2ll> 166 
--.212> PRT 

<213;-' Homo pop lens 
<400> 47 

Met Thr Ala Ala Sex Met Gly Pro Val Arg Val Ala Phe Val Val Leu 
1 5 10 15 

Leu Ala Leu Cys Ser Arg Pro Ala Val Gly Gin Asn Cys Ser Gly Pro 
20 ' 25 30 

Cys Arq Cys Pro Asp Glu Pro Ala Pro Arq Cys Pro Ala Gly Val Svx 
35 " 4 0 45 

Leu Val Leu Asp Gly Cys Gly Cys Cys Arg Val Cys Ala Lys Gin Leu 
50 b5 60 

Gly Glu Leu Cys Thr Ql.u Arg Asp Pro Cys Asp Pro His Lys Gly Leu 
65 70 75 80' 

Phe Cys Asp Phe Gly Ser Pro Ala Asn Arg Lys lie Gly Vgl Cys Thr 
8 b 90 95 

Ala Lys Asp Gly Ala Pro Cys lie Phe Gly Gly Thr Val Tyr Arg Ser 
100 105 110 

GJ.y Glu Ser Phe Gin Ser Ser Cys Lys Tyr Gin Cys Thr Cys Leu Asp 
llij 120 125 

Gly Ala Val Gly Cyn Met Pro Leu Cys Scr Met Asp Val Arg Leu Pro 
I'M) 135 140 

Ser Pro Asp Cyn Pro Phe Pro Ser Leu Pro Thr Gly Arg His Val Trp 
14b 150 155 160 

Pro Arg Pro A an Tyr Asp 
1(55 



<210> 4 8 

<211> 140 

<212> PRT 

<213> Homo sapiens 



50 



<4Q0> 4ci 

Met Glu hr,T) Sc*r Leu Arg Cys Val Trp Val Pro Lys Leu Ala Phe Veil 
3 10 15 

Leu L'ho Gly Ala tier Leu Leu Ser Ala His Leu Gin Val Thr Gly Phe 
20 25 30 

Gin lie Lys Ala Phe Thr Ala Leu Arg Phe Leu Ser Glu Pro Ser Asp 
"55 40 45 

AJ a Vnl Thr Mel Arg G3 y Gly Asn Val Leu Leu Asp Cys Ser Ala Glu 
50 5b 60 

Ser Asp Arg Gly Val Pro Val lie Lys Trp Lys Lys Asp Cys lie 
65 7 0 75 80 

Leu AJa Leu G.f y Mnt Asp G3u Arg Lys Gin Gin Leu Ser Asn Gly Ser 
85 90 95 

Lou Leu lis Gin Asn lie Leu His Ser Arg His His Lys Pro Asp Glu 
1 00 105 110 

Gly Lev Tyr Gin Cys Glu Aid Ser Leu Gly Asp Ser Gly Ser He He 
315 120 125 



Ser Arg Thr Aid Lys Val. Ala Val Ala Gly Pro Thr 
130 135 140 



<210> 4 9 
<211> 147 
<212> PRT 

<213> Homo sapiens 
<4 00> 4 9 

Mel- Ala Frn The Glu Pro Leu Ala Ser Gly He Leu Leu Leu Leu Trp 
15 10 15 

Lpu II* Ala Pro Ser Arg Al* Cys Thr Cys Val Pro Pro His Pre Gin 
20 25 30 

Thr Ala Phe Cys Asn Ser Asp Leu Val He Arg Ala Lys I?he Val Gly 
35 40 45 

Thr. Pro Gin Val Asn Gin Thr Thr Leu Tyr Gin Arg Tyr Glu He Lys 
50 5i> 60 

Met Thr Lys Met Tyr Lys Gly Phe Gin Ala Leu Xda Asp Ala Ala Asp 
55 70 75 SO 

He Arg Phe Val Tyr Thr Pro Ala Met Glu Ser Val Cys Gly Tyr Phe 
H5 90 95 

His Arg Scr His Asn Arq Ser Arg Gly Val Ser His Cys Trp Lys Thr 
3 00 105 HO 

Ala Gly Trp Thr Leu Ala Ris His Tyr Leu Gin Phe Arg- Gly Ser Leu 
lib 120 12b 

Glu Glu Pro GJu Lnu, Ser Ser Ala Pro Gly Leu His Gin Asp Leu His 



51 



130 135 140 



Cys Trp Leu 
145 



<210> 50 
<211> 82 
<2I2> PRT 

<213> Homo sapiens 
<400> 50 

Met Ser Val Lys Glu Thr Leu Pro Leu lie His Gin Gin Met Tyr Lys 
J, 5 10 15 

Gly Phe Gin Ala Leu Gly Asp Ala Ala Asp He Arg Phe Val Tyr Thr 
20 25 30 

Pro Ala Met Glu Ser Val Cys Gly Tyr Phe His Arg Ser His Asn Arg 
35 40 45 

Ser Glu Glu Phe Leu He Ala Gly Glu Ala Pro Ser Pro Arg Pro Val • 
50 55 60 

Pro His Gin Pro Val Pro Gly Ala Arg Pro Ser Asn His Glu Gly Ala 
65 70 75 SO 

Arg Leu 



<210> 51 
<211> 115 
<212> PRT 

<213> Homo sapiens 
<400> 51 

Met Ala Pro Phe Glu Pro Leu Ala Ser Gly He Leu Leu Leu Leu Trp 
1 5 10 15 

Leu He Ala Pro Ser Arg Ala Cys Thr Cys Val Pro Pro His Pro Gin 
20 • 25 30 

Thr Ala Phe Cys Asn Ser Asp Leu Val He Arg Ala Lys Phe Val Gly 
35 "* 40 45 

Thr Pro Glu Val Asn Gin Thr Thr Leu Tyr Gin Arg Tyr Glu He Lys 
50 55 60 

Me+- Thr Lys Met Tyr Lys Gly Phe Gin Ala Leu Gly Asp Ala Ala Asp 
65 70 75 80 

He Arg Phe v*l Tyr Thr Pro Ala M<st Glu Ser Val Cys Gly Tyr Phe 
S5 90 95 

His Arg Met Asp Ser Cys Thr Ser Leu Pro Ala Val Ser Trp Leu Pro 
100 105 HO 



Gly Thr Ala 
1X5 



52 



<210> 52 
<2U> 14.3 
<2U> PRT 

<213> Homo sapiens 
<4ao> 52 

Met Thr Lys Ms:t Tyr Lys Gly Phe Gin Ala Leu Gly Asp Ala Ala Asp 
1 5 10 15 

lie Arg Phe Val Tyr Thr Pro A3* Met Giu Ser Val Cys Gly Tyr Phe 
20 25 30 

His Arq Ser Has A*n Arg Ser Glu Glu Phs Leu lie Ala Gly Lys Leu 
3S 40 45 

GJ.u Asp GJy Leu Leu His Tie Thr Thr Cys Ser Pbe Val Ala Pro Trp 
50 55 60 

Asn Ser Leu Ser L«u Ala Gin Ar:q Arg Cly Phe Thr Lys Thr Tyr Thr 
65 70 " 75 SO 

Val Gly Cys Glu Clu Cys Thr Val Pbe Pro Cys Leu Ser Xle Pro Cys 
8b 90 95 

Lys Leu Gin sor Cly Thr His Cys Leu Trp Thr Asp Gin Leu Leu Gin 
100 105 110 

Gly Ser Glu Lys cly Phe Gin Ser Arg His Leu Ala Cys Leu Pro Arg 
1115 120 125 



Glu j?ro Cly Leu Cys Thr Trp Gin Sat Leu Arg Ser Gin lie Ala 
130 135 140 
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